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PREFACE 


This volume is the first part of a work designed to provide a 
convenient account of the foundations and methods of modem 
algebraic geometry. Since nearly every topic of algebraic geometry 
has some claim for inclusion it has been necessary, in order to keep 
the size of this volume within reasonable limits, to confine ourselves 
strictly to general methods, and to stop short of any detailed 
development of geometrical properties. 

We have thought it desirable to begin with a section devoted to 
pure algebra, since the necessary algebraic topics are not easily 
accessible in English texts. After a preliminary chapter on the basic 
notions of algebra, we develop the theory of matrices. Some novelty 
has been given to this work by the fact that the ground field is not 
assumed to be commutative. The more general results obtained are 
used in Chapters V and VI to analyse the concepts on which pro- 
jective geometry is based. Chapters III and IV, which will be 
required in a later volume, are devoted to a study of algebraic 
equations. 

Book II is concerned with the definition and basic properties of 
projective space of n dimensions. Both the algebraic and the 
synthetic definitions are discussed, and the theory of matrices over 
a non-commutative field is used to show that a space based on the 
propositions of incidence can be represented by coordinates, with- 
out the introduction of any assumption equivalent to Pappus’ 
theorem. The necessity of considering a large number of special 
cases has made Chapter VI rather long, but some space has been 
saved in the later parts of the chapter by merely mentioning the 
special cases and leaving the proofs to the reader, when they 
are sufficiently simple. It is hoped that this will not cause any 
difficulty. This Book concludes with a purely algebraic account of 
collineations and correlations. Certain elementary geometrical 
consequences are indicated, but a complete study of the geometrical 
problems involved would have taken us beyond our present 
objective. 

It is hoped that Volume II will appear shortly. This will be 
devoted to the theory of algebraic varieties, and to the study of 
certain loci which arise in many geometrical problems. 
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BOOK I 

ALGEBRAIC PRELIMINARIES 


CHAPTER I 

RINGS AND FIELDS 

The reader is assumed to be familiar with the use of homo- 
geneous and non-homogeneous coordinates in geometry, when the 
coordinates are real or complex numbers. When geometry is 
developed with the help of these coordinates, results are obtained 
by methods which belong to algebra, the differential calculus, and 
so on. Those results which can be obtained by purely algebraic 
processes (and they include many which are usually obtained by 
the methods of the calculus) make up the subject with which we 
are concerned in this work. 

The operations of algebra we shall study are those of addition, 
subtraction, multiplication, division, and the solution of algebraic 
equations. While ordinary complex numbers are the most familiar 
elements for which these operations are defined, there are more 
general sets of elements for which it is possible to define them. By 
allowing our coordinates to belong to these sets, a more general 
geometry is obtained. We thus arrive at the definition of a more 
general space than that considered in elementary geometry, and 
the study of this space is the purpose of this work. 

In this and succeeding chapters we consider sets of elements for 
which some or all of the algebraic operations cited above are defined, 
and step by step we arrive at a characterisation of the sets of elements 
from which our coordinates may be chosen. These sets are known in 
algebra as fields. In order, however, that the geometry which we 
derive may conform to the general pattern which appears when the 
field is that of the complex numbers, we shall find it desirable to 
impose certain restrictions on the fields considered. These restric- 
tions are not all imposed simultaneously, but in succession; and 
only when we have proceeded under the limitations already adopted 
as far as our subject demands do we impose a new condition. The 
mathematical advantages of such a method are evident. 


HP I 


i 
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I. RINGS AND FIELDS 


1. Groups. Consider a set S of elements, which we denote by 
a,b,c, .... A law by which, given any ordered pair of elements a 
and 6 of 8, possibly not distinct, we can derive a unique element c 
of 8, is called a law of composition for S. A non- vacuous set S of 
elements with a law of composition which satisfies certain con- 
ditions, explained below, is called a group. 

We denote the element resulting from the combination of a and 
b (in the given order) by ab ; if the resulting element of 8 is c, we write 

ab = c. 

ab is a uniquely defined element of S, but may be different from ba. 

The law of composition is said to be associative if, given any three 
elements a, 6, c of S, we have the equation 

(ab) c = a(bc). 

We then write this element as abc. 

The conditions that a set S, with a given law of composition, 
should form a group are that 

(i) the law of composition is associative; 

(ii) given any two elements a, b of S, there exist elements x, y 
such that 

ax = b and ya — b. 

As an example of a group, consider the possible derangements of 
the integers 1, 2, 3. If a, /?, y is any derangement of these integers, 
we denote the operation of replacing 1, 2, 3 by a, /?, y by the symbol 



The successive application of any two of the six possible operations 
is equivalent to some single operation of the set, and hence a law 
of composition for the set of operations is defined. If we denote 
the six operations 

/I 2 3\ /I 2 3\ /I 2 3\ 

\1 2 3/’ \2 3 lj’ \3 1 2/’ 

/I 2 3\ /I 2 3\ /I 2 3\ 

[l 3 2/’ \3 2 1/’ \2 1 3/ 

by a, b, c, d, e, / respectively, and denote the result of performing 
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first the substitution x and then the substitution y by xy, the com- 
plete law of composition is given by the table of double entry: 



a 

b 

c 

d 

e 

f 

a 

a 

b 

c 

d 

e 

f 

b 

b 

c 

a 

e 

f 

d 

c 

c 

a 

b 

f 

d 

e 

d 

d 

f 

e 

a 

c 

b 

e 

e 

d 

f 

b 

a 

c 

f 

f 

e 

d 

c 

b 

a 


where the entry m the row containing x and the column containing 
y is xy. From the table, or directly from the definition, it is at once 
seen that the law of composition is associative. Again, since each 
row contains all six elements, the equation 

px = q 

always has a solution; the corresponding result for the equation 

yp = q 

follows from the fact that each column contains all six elements. 
Hence the six elements, with the law of composition, form a group. 

We notice in this group that bd = e, but db = /. Hence the equa- 
tion xy — yx does not hold for all pairs of elements in the group. 
Such a group is called non-commutative. If, in a given group, the 
equation xy = yx is always true, we say the group is commutative, 
or Abelian. A very simple example of such a group is provided by 
the natural integers (positive, zero and negative), the law of com- 
position being ordinary addition of integers. 

It is often convenient, when dealing with commutative groups, 
to use the symbol of addition for the law of composition, writing 
a + b instead of ab. We then call the group an additive group. It is 
important to remember that this notation is never used for a 
non-commutative group. 

We now obtain certain properties common to all groups. From 
condition (ii) we know that, given any element a, there exist elements 
e and / such that 

ae = a, fa = a. 

Let b be any element of S. Then there exist elements c, d such that 

ac = b, da =a b. 

i-a 
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From condition (i) we have 

be = (da) e = d(ae) = da = b, 
fb = f(ac) = (fa) c - ac = 6. 

In particular, in the first of these equations put 6 = /, and in the 
second put 6 = e. Then / = /e, and fe = e. Hence e =f. If there is 
another element e' with the properties of e, 

e'e = e', and e'e = e, 

and therefore e is unique. We have thus established the existence 
of a unique element e of the group such that 

ae = a = ea 

for every element of the group. This element e is called the unity 
of the group. In the case of an additive group it is usually called 
the zero of the group, and denoted by 0. 

Now consider the equation 

ax = e, 

where a is any element of the group, e being the unity. By (ii) this 
has a solution x. Then 

xax = xe = x, 

and therefore the element f = xa has the property fx = x, for the 
x considered. But, by an argument used above, fb = b for any b in 
the group. In fact, let c be an element satisfying the equation 
xc = 6. Then 

fb = fxc = xc = b. 

It follows, taking b = e, that / = e. Therefore xa = e. If y is any 
element such that 

ay = e = ya, 


then y — ye — yax = ex = x. 

Hence x is uniquely defined by the equations 

ax = e = xa. 

This element is called the inverse of o, and is denoted by a -1 . (In 
the case of an additive group it is called the negative of a, and 
denoted by —a. We then write 6 —a for b + (—«)•) 
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We now show that the equations 

ax = b, ya = b, 

where a, b are any elements of the group, serve to define x and y 
uniquely. For 

x = ex = a~ l ax = o -1 6, 
and y = ye = yaa~ x = 6a. -1 . 

Hence a; and y are determined explicitly. In particular the equation 

a~ x x = e 

has a unique solution. But 

a~ x a = e. 

The solution is therefore x = a. Therefore 

(a -1 ) -1 = a. 

In the case of an additive group this becomes 

— ( — a) = a. 

A non-vacuous subset s of S may, with the law of composition 
assumed for the elements of S, also form a group. This is called a 
subgroup of the given group. The following conditions are evidently 
necessary and sufficient for the elements of s to form a subgroup: 

(i) if s contains elements a, b, it contains ab; 

(ii) if s contains an element a, it also contains a~ x . 

We conclude this section with a brief reference to an evident 
generalisation of the example of a group described above, namely, 
the symmetric group whose elements are the permutations of the 
numbers 1,2, 3, ...,w. The law of composition is defined as in the 
example. An important subgroup of this group is the alternating 
group. To define the alternating group we assume the definition of 
polynomials in the n indeterminates x v ...,x n which will be given 
in § 5. Consider the polynomial 

A = Tl(x t -x k ) (i,k = 1,2, ...,»). 

i<k 

If the suffixes 1,2, ...,n are permuted, it is easily seen that the 
polynomial A is either unchanged (except for the order of the 
factors) or becomes — A. Permutations which leave A invariant are 
called even permutations, the others are called odd permutations. 
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A transposition, that is, a permutation which interchanges two 
suffixes only, is seen to be an odd permutation. Any permutation 
can be regarded as the product of transpositions. Such a decom- 
position of a permutation into transpositions is not unique, but the 
parity of the number of transpositions for a given permutation is 
independent of the method of decomposition. The product of two 
even or two odd permutations is even; the product of an even and 
an odd permutation is an odd permutation. Hence the set of even 
permutations of the symmetric group forms a subgroup, which is 
called the alternating group, and the number of even permutations 
in the symmetric group is equal to the number of odd permu- 
tations. 

2. Rings. A set of elements may have more than one law of 
composition. We shall be particularly concerned with sets having 
two laws of composition, under one of which the set forms a com- 
mutative group. We write this group as an additive group, and 
refer to the corresponding law as the addition law of the set. The 
zero of the group is called the zero of the set. 

The second law of composition is called the multiplication law, 
and the result of combining elements a, b of the set by this law is 
denoted by the product ab. Multiplication need not be commutative, 
but we shall require it to be associative. It is said to be distributive 
over addition if 

a(b + c) = ab + ac, and (b + c)a = ba + ca, 
for all a, b, c in the set. 

A ring, then, is a set of elements with two laws of composition, 
addition and multiplication, with the properties: 

(i) the set is an additive group with respect to addition; 

(ii) multiplication is associative, and distributive over addition. 

The following examples will illustrate the various possibilities 
which may arise in the study of rings. In the first four cases the 
laws of composition for the elements involved are addition and 
multiplication as usually defined: 

I. The set of all complex numbers. 

II. The set of all integers, positive, zero, and negative. 

III. The set of all even integers. 

IV. The set of all integers, reduced modulo the integer m. 
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V . The set of all matrices of q rows and columns whose elements 
are complex numbers. A matrix of this type is the array 


a ll 

<*12 


<*21 

<*22 

• a 2 q 

,<*9l 

<* 9 2 

• &qq, 


where each is a complex number. Addition is defined by the rule 


a 


ii 


a 


12 


a, 


21 


a, 


22 




fli 


a, 


<f2 



An A 12 • Aia'' 

A 21 A 22 • A2a 


'’51 ^a2 • Aaa' 

! a n + All a 12 + Al2 
_ / a 21 + A21 a 22 + A22 


a ia + Ai a N 

<*2a + A2a 


<<*al+A fl i a a 2 + A 8 2 

and multiplication by the rule 



a- 


12 


a 21 a 


22 


<*gl a <?2 



where 


’All Al2 

A 21 A 22 


,A«1 Aa2 • A< 


7 « — 2 a ikPk]- 

fc-i 


a aa A, 


99/ 


7n 7i2 



7 3 i 7a2 



The reader may easily verify that these sets, with the prescribed 
laws of addition and multiplication, form rings. Further study 
reveals certain features common to some of the rings, but not 
necessarily to all. 

(i) In the cases I, II, III and IV the zero of the ring is the number 0. 
In case V the zero is the zero matrix 


In all five cases 


0= /0 0 . 0' 
/ 0 0.0 
I • • • 

\o 0 . 0 , 

a.O = 0 = O.a 


for all elements a of the ring. 
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(ii) In cases I, II and IV let e be the integer 1, and in case V let 

e- /I 0 . 0^ 

0 1.0 


^0 0 . 1 , 

in each of these cases e has the property given by the equations 

ae = a = ea 

for every element a of the ring. When such an element e exists, it 
will be shown to be unique. It is called the unity of the ring. In 
case III there is no element of the ring with this property. 

(iii) In all but the last case multiplication is commutative. In 
case V it is non-commutative, since, if q > 1, 


/ 1 

0 . 

°\ y 

(° 

0 . 

° 

\0 

• • 

0 . 

0 / \ 


ii 

21 


a, 


and 


ai 


a, 


•32 


ra- 


il 
a 2i 


a 12 

a 22 


V«ai a 32 



0 

0 


0 0 



a 


18 1 
0 


(iv) In cases I, II and III, if a and b are two elements of the ring 

such that , „ 

ab = 0, 

then either a = 0 or b = 0. 

This property holds for the ring IV if and only if m is a prime 
number. If m = pq, where neither p nor q is 1, thenp and q are two 
non-zero elements of the ring such that pq = 0. On the other hand, 
if m is prime and ab = 0, so that 

ab = cm, 

it follows, by the unique factorisation properties of ordinary integers, 
that either a or 6 is divisible by m. 

The property we are discussing does not hold in case V if q> 1. 

F0r / 1 A A\ /A /\ A\ Q 


/I 

0 . 

°\ , 

(° 

0 . 

°\ 

0 

0 , 

0 

' 0 

f 

1 . 

0 

\o 

0 , 

: o /' 

lo 

• * 

0 . 

0 / 


and neither factor is the zero of the ring. 
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(v) In case I there is associated with every element a of the ring, 
other than the zero, a unique inverse a -1 such that 

aa -1 = a -1 a = e. 

The rings II and V do not have this property, and it can be shown 
quite simply that the ring IV has the property only when m is a 
prime number. The property is meaningless in case III since the 
ring has not unity. 

(vi) If we write 

a + a = 2a, a + 2a — Za, etc., 

the elements a, 2a, 3 a , ... (a 4= 0) are all distinct, except in case IV, 
when 

ma = 0 

for all elements a in the ring. 

It will be observed that only in (i) did we have a property common 
to the five rings, namely, the zero has the property aO = 0 = 0a. 
We now show that this property holds for all rings, and deduce 
other elementary results true for any ring. 

Let a, 6 be any two elements of a ring R, and let 0 be the zero 
of R. Then 

a + 0 = a, 

and therefore ba — b(a + 0) = ba + bO, 

and also ab — (a + 0) b = ab + Ob. 

From the uniqueness of the zero of a ring it follows that 

60 = 0 = 06, 


these equations holding for any element 6 in R. 
Again, we know that any equation 


has a unique solution 


in R. Now 


a + x = 6 


x — b + (—a) — b—a 


a(b — c) + ac = a(b — c + c) 
= ab. 


Hence 

and similarly 


a(b — c) — ab — ac, 
(b — c)a — ba — ca. 
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Therefore, taking b = 0, we obtain the equations 

a( — c) = —ac, ( — c)a — —ca 
for all a, c in R. Again, 

(-a)(-b)-ab = (-a)(-6) + (-a)6 

-(-«)( — b + b) 

= (-a)O = 0, 

and therefore ( — a) ( — b) = ab. 

We have thus shown that the usual multiplicative properties of 
the minus sign hold in any ring. 

We now define the relationship between two rings known as 
isomorphism. Let R and R* be two rings such that to each element 
a of R there corresponds a unique element a* of R*, and such that 
any element a* of R* arises from exactly one element a of R. Such 
a correspondence is said to be one-to-one. Now suppose, in addition, 
that the correspondence is such that if a, b correspond respectively 
to a*, b*, then a + b and ab correspond respectively to a* + b* and 
to a*b*. The correspondence is then called an isomorphism. 

Isomorphism between rings is a relation of the class known as 
equivalence relations. Consider any set S of elements a, ft, y, . . . , and 
let there be a relation, which we denote by ~ , between the elements 
of S , so that, given any two elements a, /?, we know whether a~/? 
is true or false. If the relation ~ is : 

(i) reflexive, that is, a ~a for all a in S; 

(ii) symmetric, that is, a~/? implies /?~a; 

(iii) transitive, that is, if a ~ 0 and /?~y, then a~y; 

we say it is an equivalence relation. 

An equivalence relation between the elements of a set S divides 
S into subsets, no two of which have any elements in common. If 
a, P are in the same subset, a~ ft. Every element of S lies in one of 
these subsets. 

It is clear that if S is the set of all rings, and if a ~ /? means that 
the ring a is isomorphic with the ring ft, the relation is an equivalence 
relation. We shall often speak of two isomorphic rings as being 
equivalent, implying that if in our discussion we replace one ring 
by the other (making any necessary consequential substitutions) 
nothing in our conclusions is altered. 
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A subset 8 of the elements of a ring R is said to form a subring 
of R if the elements of 8 form a ring under the addition and multi- 
plication laws of R. For this to be so it is necessary and sufficient 
that if a and b are any two elements of S, then a — b and ab belong 
to 8. 

If S is a subring of R, R is said to be an extension of 8. The fol- 
lowing theorem is frequently used: 

Theorem I. If A and B* are two rings, and A is isomorphic with 
a subring B of B*, there exists an extension A* of A which is iso- 
morphic with B*, this isomorphism including that between A and B. 

Let C be a set of elements in a one-to-one correspondence Fwith 
those elements of B* which are not in B, and let A* consist of the 
elements of A and C. We have merely to define addition and 
multiplication in A* so that A* becomes a ring with the required 
properties. Let a v a 2 , ... be elements of A*. To each a { there corre- 
sponds a unique element b i in B*. If a t is in A, b { is the element of 
B corresponding to a i in the isomorphism between A and B. If 
a i is in C, b i is that element of the set of elements of B* not in B 
which corresponds to a t in the correspondence r. Conversely, to 
any element b t of B* there corresponds a unique element eq of A*. 
Now, if Oj, a 2 are any two elements of A*, we define a 1 + a 2 and a x a 2 
as follows. To a 1 and a 2 there correspond b x and b 2 in B*. Let 

bi + b 2 = b 2 , b- l b 2 = b 4 , 

and let a 8 and a 4 be the elements of A* which correspond to 6 3 
and fc 4 . We then define 

cq -f- a 2 — flg, a^a 2 — cq. 

Clearly A* is a ring isomorphic with B*. Moreover, the construction 
shows that A is a subring of A*. Hence A* has all the required 
properties. 

We conclude this section by describing a particular type of sub- 
ring which we shall meet in later chapters. Let R be any ring, I a 
subset of R with the following properties: 

(i) if a, b are in I, then a— b is in /; 

(ii) if a is in I, r in R, then ra is in I. 

We call I a left-hand ideal in R\ a right-hand ideal is defined by 
reading ar instead of ra in (ii). The two ideals coincide when 
multiplication in R is commutative. 
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To show that if I is not vacuous it is a subring of R we remark 
that if r is in I conditions (i) and (ii) coincide with the conditions 
stated above for a subring. On the other hand, not all subrings are 
ideals, since the condition (ii) need not hold. 

R itself is an ideal in R, and is called the unit ideal. The subset 
of R which consists only of the zero of R is also an ideal. These two 
ideals are usually called the improper ideals of R. A ring containing 
a proper ideal is the ring of integers (Example II, above). If m is 
any integer greater than 1, the set I of all integers of the form ± ma, 
where a is an integer, clearly forms a proper ideal in this ring. 

The existence of an ideal I in R enables us to define an equi- 
valence relation in R. If a, b are two elements of R, we write 

a = b (I) 

if a — b is in I. The reader may easily verify that this is an equi- 
valence relation. It is a type of equivalence which will often appear 
subsequently. 

3. Classification of rings. Rings which are subject to no other 
conditions than those imposed by the definition given in §2 are 
too general to be of much interest in algebraic geometry. We limit 
the rings with which we deal by imposing fresh conditions which 
are, in fact, equivalent to requiring the rings to possess one or more 
of the properties noted in the remarks (ii), ..., (vi) on the examples 
of rings given in § 2. Different restrictions lead to rings with quite 
different properties. We now consider some of the more important 
types of rings. 

We saw that a ring R may or may not possess an element e with 
the property 

r r j ea = a = ae 

for every element a in R. We now show that R cannot have two 
distinct elements e, / with this property. For, if 

ea = a, bf = b, 

for all elements a, b in R, we may take a = /, b = e, and obtain 

the equations » _ , _ 

j — ej — e. 

If this unique element e exists in R, it is called the unity of R. 
All the rings we shall consider will be rings with unity, unless the 
contrary is stated explicitly. For the present we continue to denote 
the unity by e. 
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If a is any element of a ring R with unity, there may or there may 
not exist an element a -1 of R such that 

aa~ x = e = a~ x a. 

There cannot be two such inverse elements, for if 

ax = e, 

then a~ x ax = o -1 e, 

that is, x = a~ x . 

When such an inverse element a~ x exists, a is said to be regular, or 
to be a unit of R. Clearly e and — e are units of R. On the other 
hand, the zero 0 is not a unit, since for every element a of I? 

aO = 0. 

Again, we have seen [§ 2, (iv)] that there are rings R in which 
there exist pairs of elements a, b, neither of which is zero, such that 

ab = 0. 

We call a a left-hand divisor of zero, b a right-hand divisor of zero. 
The divisor of zero a cannot be regular. For, if a has an inverse a -1 , 

b = a~ x ab = a -1 0 = 0, 

contradicting our assumption that b is not zero. Similarly, b cannot 
be regular. 

The two types of ring with which we shall be particularly con- 
cerned are the integral domain and the field. An integral domain is 
defined as a ring with unity, in which multiplication is commutative, 
and which has no divisors of zero. Examples I, II, and IV (in the 
case m prime) of § 2 are integral domains. 

A field is defined as a ring with unity in which every non-zero 
element is regular. The ring of all complex numbers is clearly a 
field. By the result proved above a field cannot contain any divisors 
of zero. A commutative field — that is, a field in which multiplication 
is commutative — is an integral domain. 

We shall eventually be able to confine our attention to com- 
mutative rings, but it is convenient to obtain certain results which 
are valid for non-commutative rings, as these will be used in an 
analysis of the postulates on which projective geometry is usually 
based. Therefore, except when we say so explicitly, we shall not 
assume our rings to be commutative. 
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In a ring R consider any element a. Write 

a+a = 2a, a+2a = 3a, ..., 

noting that na is not a product, but the sum of n elements each 
equal to a. We have seen by an example that there exists a ring R 
for which we can find an integer m such that 

via = 0 

for all elements a in R. 

Now, let R be a ring with unity e, and suppose that for some 
positive integer n, _ n 

Let m be the smallest positive integer satisfying this condition. 
Then, first of all, if a is any element of R, 

ma = a + a + ... +a 
= ea + ea + ... + ea 
= (e + e + ...+e)a 
= (me) a 
= 0 . 


If m is not prime, and m = pq, then 

pe - {= 0, qe-- 1=0, 

but peqe = pqee 

= me = 0, 

and R has divisors of zero. The number m is called the characteristic 
of the ring R. We have proved that the characteristic (if there is 
one) of an integral domain or a field is a prime number. 

If, for all non-zero integers n, 

ne- (=0, 

the ring is without characteristic. (It is then often said to be of 
characteristic zero, but this terminology is not strictly correct.) 

If a ring R with unity has characteristic m, the subset 

0, e, 2e, ..., (m— l)e 

forms a subring. For 

ae — be = (a — b)e 
= ce, 

where c = a — b ( modulo m), 

and (oe) (be) = de, 

where d = ab (modulo m). 
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Thus R contains a subring isomorphic with the ring of integers 
reduced modulo m. By Th. I, § 2, there is an extension of this ring 
isomorphic with R. 

Similarly, if R is without characteristic, it is isomorphic with an 
extension of the ring of natural integers. It follows that a ring 
without characteristic contains an infinite number of elements. 

In each case we identify R with its isomorph, and so in future we 
shall write the unity of the integral domain or field we are con- 
sidering as 1. 

Later on we shall confine ourselves to integral domains and fields 
without characteristic, but at present we do not make this restriction. 

4. The quotient field of an integral domain. It is a familiar 
result that the integral domain of the natural integers is embedded 
in a field, namely, the field of rational numbers. This is a special case 
of a general theorem which we shall frequently use. 

Theorem I. Given an integral domain I, there exists a commuta- 
tive field K which, regarded as a ring, contains I as a subring. 

The construction which we give for K gives the minimum field 
having the required property. This minimum field is defined 
uniquely, to within isomorphism. 

The elements of I are denoted by a, b, c, . . ., the zero and unity by 
0 and 1. Consider the set of all ordered pairs (a, b) in which 6=t=0, 
and the relation denoted by ~ between pairs where 

(a,b)~{c,d) 

stands for the equation 

ad — be = 0. 

Since multiplication is commutative, it follows at once that this 
relation is reflexive and symmetric. It is also transitive. For, if 

(a, b) ~ (c, d) and (c, d) (e,/). 
ad = be and cf = de. 

Therefore adf = bef = bde, 

and so , d{af— be) = 0. 

From our definition of an ordered pair, d is not zero. Since an in- 
tegral domain contains no divisors of zero it follows that 

af = be, 

(a,b)~(e,f). 


that is, 
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The relation is therefore an equivalence relation, and divides all 
pairs under consideration into classes. The class to which (a, b) 
belongs will be denoted indifferently by [a, 6] or [c, d], where 

(a,b)~ (c,d). 

We define the sum of a pair (a, b) and a pair (c, d) by the equation 
(a, b) + (c, d) = (ad + be, bd). 

By hypothesis, b and d are not zero, and therefore bd =f= 0. The pair 
denoted as the sum is therefore admissible. It is evident that 
addition of pairs is commutative. Now let 

(a, b) ~ (o', b'), (c,d)~(c',d’). 

Then (a', 6') + (c', d') = (< a'd ' + b'c', b'd') ~(ad + be, bd), 
since (ad + be) b'd' — (a'd' + b'c') bd 

= (ab' — a'b) dd' + (cd' — c'd) bb' 

= 0. 


Hence we may define the addition of classes by the equation 
[a, 6] + [c, d] = [ad + be, bd ]. 

Since 

([a, 6] + [c, d]) + [e, /] = [ad/+ bef+bde, bdf ] 

= [a,b] + ([e,d] + [e,f]), 

addition is associative. Also, the equation 

[a, b] + x= [c, d] 


has the solution 


x = [be — ad, bd]. 


Hence the classes form a commutative group under addition. The 
zero of the additive group is [0, 1] = [0,a], where a is any non-zero 
element of I. 

We now define multiplication of pairs by the equation 
(a,b)(c,d) = (ac,bd). 

Multiplication is clearly commutative. Also, if 

(a,b)~(a',b'), (c,d)~(c',d'), 

then (a'c' ,b'd')~(ac,bd), 


a'c'bd — ach'd' 

= (a'b — ab') c'd + (c'd —cd')ab' 

- 0 . 


since 
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Thus we can define multiplication of classes by the equation 

[a, 6] [c, d] = [ac, bd ]. 

Multiplication can immediately be proved to be associative. Again, 
[a, b] ([c, d] + [ e , /]) = [a, b] [cf+ de, df ] 

= [acf+ade, bdf] 

= [abcf + abde, b 2 df ] 

= [ac, bd\ + [ae, 6/] 

= [a,b][c,d] + [a,b][e,f]. 

Hence multiplication is distributive over addition. The set of 
classes of equivalent pairs, with addition and multiplication defined 
as above, therefore forms a commutative ring K*. If a is any 
non-zero element of I, 

{a, a) -(1,1), 

and so [a,a] = [l, 1], 

Since [a, 6] [1,1] = [a, b], 

[1, 1] is the unity of K*. Finally, if [a, 6] is any non-zero element 
of K*, both a and b are non-zero. Therefore [ b , a] is in K*. But 

[b, a] [a, b] = [ab,ab] = [1,1]. 

Therefore every non-zero element of K* has an inverse. Thus K* 
is a commutative field. 

Now consider the elements of K* which can be written as [a, 1], 
and let them form the subset S. Since 


[a, 1] — [6, 1] = [a — b, 1], 
and [a, 1] [ b , 1] = [ab, 1], 

S forms a subring of K* isomorphic with I. By § 2, Th. I, there 
is an extension K of I which is isomorphic with K*, and which is 
therefore a field. The element of K corresponding to [a, b] of K* is 
usually written a/b. If 

a _ c 

b~d’ 

then ad = be. 

We also have a 


HP I 


2 
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The field K is the one required. It is usually called the quotient field 
of I. To complete the proof of our theorem we now show that any 
commutative field K' which contains I as a subring contains a 
subfield isomorphic with K. 

We first observe that, by the definition of a field, the non-zero 
elements of K' form a group under multiplication. Hence the 
equation 

bx = a (64=0), 

where a, b are in IC, has a unique solution in K' . We now show 
that the equations 

bx = a, dx = c, 

where a, b, c, d are in I, and b and d are not zero, have the same 
solution in K' if and only if 

ad = be. 


First, if this condition is satisfied, and a; is a solution of the first 
equation, we have 

bdx = ad = be. 

Therefore b(dx — c) = 0, 

that is, dx = c. 

Secondly, if the two equations have the same solution x, then 

ad = bdx = be. 


Thus to every element x of K' which satisfies an equation 

bx — a, 

where a, b are in I, and b 4= 0, there corresponds a unique element 
a/6 of K, and conversely. Moreover, if x, y correspond to a/6, c/d 
respectively, then 

bx = a, dy = c. 

and therefore db(x +y) = ad+bc, 

and bdxy = ab. 

Hence x+y and xy correspond to | + J and ~ respectively. The 
field K' therefore contains a subfield isomorphic with K. 
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From this theorem we deduce that K is the smallest field which 
oontains I as a subring. 

Finally, we note that if I is of characteristic p, the sum to p terms 

1 + 1 + ... + 1 — 0 . 

Hence [1,1] + [1,1]+... + [1,1] = 0, 

and it follows that K is of characteristic p. On the other hand, if 
I is without characteristic, 

[ 1 , 1 ] + [ 1 , 1 ] + ... + [ 1 , 1 ] = [1 + 1 + ... + 1 , 1 ] * [ 0 , 1 ], 

and hence K is without characteristic. 

5. Polynomial rings. We now discuss a method of extending 
a ring by the adjunction of an element x, called an indeterminate. 
The only case of any importance which we shall meet in this book 
is that in which the ring is an integral domain, and it will make for 
simplicity if we assume throughout this section that the basic ring 
from which we start is an integral domain I. 

Let us consider the ordered sets 

(®o> a v • ■ ■ > a r) 

which consist of a finite number of elements a t of I. Two sets 

K> *•'» ®r)> (^o> • • •> bg) 

are said to be equivalent if 

a i = 6 i (i = 1, 2, ..., min (r, s)), 

a t = 0 (i — s+ 1, . ..,r) when r> a, 

bi = 0 (i = r+1, ...,«) when s >r. 

This relation is evidently reflexive, symmetric and transitive, and 
is therefore an equivalence relation. We define the addition of two 
sets by the equation 

(Uq> ®i» •••> ®r) "h (b 0 , b\, •••> bg) = (Cq, Cj, ..., c ( ), 

where t = max (r, s), and c { = a { + b { , defining a t — 0 for i>r and 
b t = 0 for i > 8. It is easily seen that if either of the two sets is 
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replaced by an equivalent set, the sum is replaced by an equivalent 
set. Multiplication of two sets is defined by the equation 

(Oq, . ,.,( ir) {bg, b^, ...,bg) = (df), d-y, . . ., d r + a ), 

where d { = a i b 0 +a i _ 1 b 1 + ... +a 0 b { (i = 0, ...,r+a), 

and, as above, we define a t = 0 for i>r, and 6^ = 0 for i>s. Again, 
the replacement of either set by an equivalent set converts the 
product set into an equivalent set. If we now consider the classes of 
ordered sets defined by the equivalence relation, we see that they 
form an additive group with respect to addition, the zero of the 
group being all sets equivalent to the set (0); also, multiplication is 
associative, and distributive over addition. Hence our classes of 
equivalent sets, with the two laws of composition we have defined, 
form a ring R, and it is easily seen that (1) is the unity of R. This 
ring B contains a subring which is isomorphic to I. The elements 
of this subring are (a). For 

(a) + (b) = (a +6), 

and (a) ( b ) = (ab). 

Hence an isomorphism is set up if (a) is mapped on the element 
a of I. 

Let us now consider the set (0, 1). We see that 

( 0 , 1 ) 2 = ( 0 , 1 ) ( 0 , 1 ) = ( 0 , 0 , 1 ), 

( 0 , 1 ) 3 = ( 0 , 1)2 ( 0 , 1 ) = ( 0 , 0 , 0 , 1 ), 


(0, l) n = (0, l)” -1 (0, 1) = (0, 0, 0, ..., 1), 
where in the last set written the unity is preceded by n zeros. Since 
(«)(0, l) n = (0,0,0, ...,a), 
we may express the set (a 0 ,a v as follows: 

(«„,«!, ...,a r ) = (a 0 )+K) (o, i) + (<*») (o,i) 2 + ... + (« r )(o, i y. 

Now let us denote by /[a;] the ring which is an extension of I and 
which is isomorphic with R [§ 2, Th. I], x being the element of the 
extension which is mapped on the set (0, 1 ) of R. Then, from the 
above reasoning, the set (a Q , a 1 , ...,a r ) of R is mapped on the 
element a 0 + a t x + a 2 x 2 + . . . + a r x r of I[x] . These elements are called 
polynomials, with coefficients a 0 , %,..., a r . If a r 4= 0, the polynomial 
a 0 + a 1 x+ ... + a r xf is said to be of degree r. 
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We note that the zero (0) of R is mapped on the zero of I, and 
therefore the equation 

o 0 + a 1 a; + ... + a r af = 0 (1) 

is satisfied if and only if 

®o = a i — ... = a r = 0. 

Because of this property of the element x, it is said to be an in- 
determinate over I. 

In the representation of a polynomial we usually omit any term 
in which a i = 0, whether i exceeds or is less than the degree of 
the polynomial. In this sense a n x n is a polynomial whose complete 
representation is 0 + 0* + . . . + Oaf 1-1 + a n x n . Furthermore, 

a 0 + a 1 a; 1 + ... +a r a f 

can be regarded as the sum of the r + 1 polynomials 

a 0 , a x x, ..., a r x r . 

Since also a i x i is the product of a t and x i , and since for the poly- 
nomials x, x 2 , ... we have the multiplication law 

x p xfl = X p+Q , 

the ring I\pi\ is obtained by introducing a new element x among the 
elements of I which can be multiplied commutatively by any 
element of I . We say that /[a;] is obtained from I by adjoining the 
indeterminate x to I . I [a;] is also called the ring of polynomials in 
x over I. 

These results apply when I is any commutative ring with unity. 
We now proceed to draw conclusions concerning the ring I[x\ from 
the fact that I is an integral domain. 

Theorem I. If I is an integral domain, so is I[x\. 

We have already seen that I[x] is a commutative ring with 
unity. We must now show that I\x\ has no divisors of zero. If 

f =a 0 + a x x + ...+a m x m , 
and g = b 0 + b 1 x+ ...+b n x n 

are two non-zero polynomials, we may assume a m + 0, 6 n 4= 0. Then 
fg = c 0 + c 1 x+...+ c m+n x m + n , 

where c m+n — a m b n ^0, since I has no divisors of zero. Hence 
/gr+ 0. In other words, if fg = 0, and /= j=0, we must have <7 = 0. 
Therefore /[a;] has no divisors of zero. 
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The process of adjoining an indeterminate to an integral domain 
to obtain another, more extensive, integral domain may be repeated. 
If J = I[xf\, J[z 2 ] is an integral domain. Its elements are poly- 
nomials , , , n 

a 0 + a 1 x 2 + ...+a n x%, 


where a* = a i0 + a n x 1 +...+ a im x f 


is in and m is the maximum of the degrees of a 0 , ..., a n . The 
elements a {} lie in I. Hence, the elements of J[x 2 ] can be written 
in the form „ m 


S 

<»<H=o 


in which each term a i} x\x{ is the commutative product of a i} , x\, x{. 
If J' is the integral domain I[x 2 ], the elements of J'\xf\ are of the 

form m n 

S S 

Oj-O 

and the integral domains J[x 2 ] and J'[x{\ are easily proved to be 
equivalent. We denote this integral domain by I[x v a; 2 ]- 

Proceeding in this way, we can adjoin indeterminates x lt x 2 , . . . , x T 
in turn to I. The resulting ring I[x v ...,x r ] is independent of the 
order in which the indeterminates are adjoined. For the elements 
of this ring are of the form 

th iw . 

S — S a*,...**}* — 

<,-0 <,-0 


the term written explicitly being the commutative product of 
a ii ir , x*f , ..., xjr. Also, I[x v ..., a: r ] is an integral domain. We have 
already proved this result when one indeterminate is adjoined to I. 
We assume it is true when r — 1 indeterminates are adjoined. The 
result is assumed true for J = I[x v ..., x r _^\, and by Th. I is there- 
fore true for J[x r ], But J[x r ] = I[x v ...,x r ]. Hence we have 

Theorem II. If I is an integral domain, so is I[x v ...,xf\. 

We conclude this section by introducing an important operation 
on polynomials in I[x v ...,x r ] known as specialisation. Let 


n, n, 

/- E ... E H...ir X l‘ — X r 
0 it - 0 

be any polynomial in our integral domain. We select any number 
j (0^j<r) of the indeterminates x x , ...,x T , say, for example, 
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X v 

*1. 


x } . Let a v ...yCt] be any elements of I. Then if in/ we replace 
x } by a v . . . , otj respectively, / becomes 


»i+i 

/*= s • 

i }+ 1 “0 


n r 

. S 6 

U - o 


Vn...<r%+1 •••“¥> 


if+i 


X? 


where , *» , 

V. ... = S • • • S % <, a i l - 

it =0 ij = 0 

is in J. (a f is, of course, the product of i factors each equal to a.) 
Then / is said to specialise to a polynomial /* in I[x }+1 , ...,a; r ]. 
We use the notation 

/-*/*, .... Xj-^cCj 

to describe the specialisation. 

By using the fact that equations such as 

f+9 = h, fg = k 

express relations between the coefficients of /, g, h and k which do 
not involve the indeterminates x x , ...,x r , we see at once that if, 
for any specialisation of the indeterminates, 

/->/*, g-+g*, h^h*, k^k*, 

then f* + g* = h*, f*g* = k*. 

The converse of this result is not, of course, true. 


Note. While we do not have to consider polynomials whose 
coefficients are in a non-commutative ring, it should be pointed out 
that a theory of such polynomials exists. The only result of this 
theory which we shall use is the following. Let 

f = a 0 x m +a 1 x m ~ 1 + ...+a m , 

where a t is in a non-commutative ring K, x is indeterminate and is 
assumed to commute with every element of K. Then if a is any 
element of K , there exist polynomials 

p = box™- 1 + . . . + 6 m _ x , q = CoX™- 1 + . . . + c m _ v 
and elements b m , c m of K such that 

f=(x-a) {b 0 x m ~ l + . . . + b m _ x ) + b m 
= (Co*” 1 " 1 + .. . + c OT _ x ) (x—a) + c m . 

Indeed, we need only take 

b t = a^o + a* -1 ^ + . . . + a u \ 
c t — a 0 a i +a x a i - 1 + ... +a i \ 


(i = 1, ... ,m ). 
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6. The division algorithm. In this section /denotes an integral 
domain, K a commutative field, and I[x], K[pc\ the rings obtained 
by adjoining the indeterminate x to I, K respectively. 

Let f = a 0 +a 1 x+ ...+a m x m , 

g — b 0 +b 1 x+ ... + b n x n {n^m) 

be two polynomials in I[x ] of degrees m, n respectively, and let us 
suppose, in the first place, that b n = 1. Then, clearly, 

/i =f~a m x m - n g 

is a polynomial in /[a;] of degree m 1 <m. If n < m lt and if 
f 1 = a' 0 +a' 1 x+...+a' mt x m ', 
then h=fi-a' m x m '- n g 

= f~ {a m x m - n + a' mi x m '~ n ) g 

is a polynomial of degree m 2 <m 1 . Proceeding in this way we arrive, 
in all cases, at a polynomial q of degree m — n such that 

r=f-qg 

is a polynomial of degree less than n, the degree of g. 

This result can be generalised to include the case in which b n is 
a regular element of I. For, if b^ 1 b n — 1, 

g' = K x g 

is a polynomial of degree n in which the term in x n has coefficient 
unity. Then there exists a polynomial q' such that 

r=f-q'g' =f-{q'bn 1 )g 

= f-qg 

is of degree less than n. 

We now show that the polynomials q, r obtained in this way are 
unique. For if q' , r' are any two polynomials such that r' =f—q'g 
is of degree less than n, we have the equation 

r-r' = ( q'-q)g . 

But the degree of r— r' is less than n, and the degree of (q' —q)g is 
at least n, unless q’ = q. Hence, unless q' = q, we have a polynomial 
of degree less than n equal to a polynomial of degree at least n. 
This is impossible, and therefore q = q\ and consequently r = r'. 
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The process given above for determining q and r is called the 
division algorithm. 

A special case of the result just proved is the Remainder Theorem. 
If c is any element of I, take g = x — c. Then there exists a poly- 
nomial q such that , , . 

J-q{x-c)-r 

is in I. This equation remains true when we make the specialisation 

a;->c. Hence ,, . 

r =/(c), 


where /(c) is the result of replacing x by c in/. 

Returning to the division algorithm, it may happen that r = 0. 
We then have three non-zero polynomials/, g, q in I\x\ such that 


f = M- 

g and q are said to be divisors of/, and / is a multiple of g and of q. 
If the degrees of both g and q are greater than zero, both g and q 
are said to be proper divisors of/. 

We now suppose that I is a field K. Since every non-zero element 
of A is regular, the division algorithm can be applied to any pair of 
polynomials /„, / x of K[x]. By repeated applications of the division 
algorithm we shall construct a sequence of polynomials / 2 , / 3 , .... 
We shall denote the degree oif i by m t , and suppose that m Q ^m 1 . Our 
purpose is to obtain information on the common divisors of /„ a,nd/ 1 . 
Applying the algorithm to / 0 , we get the equation 

fo = hfi+U 

where m 2 < m v If / 2 is not zero we apply the algorithm to / x , / 2 , 
obtaining the equation 

fi = 9 2 A + A. 


and proceed thus until we have 

f 8—1 = q.Js "b f 8+ 1> 

where f s+1 is zero. We reach this stage after at most m x + 1 applica- 
tions of the algorithm. 

Now, let g be a divisor of/ 0 and of/ x . Then, if 

fo — ho9> 
fi = 

fi — h^g, 

h% = ho q^h j. 


and 

we have 
where 
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Hence / 3 = h a g, 

where ~ ^ 2 ^ 2 ^ 

and so on. Finally . , 

J fs = h9, 

where h 8 = h s _ 2 - q 8 _ r h„_ v 

Thus any common divisor of/ 0 and/ x is a divisor of the set of poly- 
nomials / 2 , Conversely, let g be a divisor of/ 8 . Then since 

fs = h s9, 

we have f 8 _ x = qj 8 = q 8 h 8 g = h 8 _ x g, say, 

fs- 2 = (9s-iK-i + K)9 = K-z9, say, 
and so on, until finally we obtain 

fi = K 9, 

and /„ = h 0 g. 

Thus any divisor of/ g is a common divisor of/ 0 and/,. 

Since any common divisor of/„ and is a divisor of/ s , the degree 
of such a divisor g cannot exceed m 8 , the degree of f 8 . On the other 
hand, f a is itself a divisor of f 8 , and therefore of /„ and f v These 
polynomials therefore have a common divisor of degree m a , and 
have no common divisor of higher degree. We call f s , or af 8 , where a 
is any non-zero element of K, the highest common divisor of /„ and f x . 

The division algorithm enables us to express f 8 in terms of/ 0 , 
as follows. We have the equations 

h =fo-9ifv 
fz ~ fi~ ( hfz 

= -? 2 /o + (i+?x q%)fu 
and by a simple induction we find that 

fs = afo + bfi, 

where a, b are polynomials in K[x\. 

If f 0 ,fx have no divisors of degree greater than zero,/,,,/! are said 
to be relatively prime elements of K[x\. Then m s = 0, and f s is a 
non-zero element of K. Let its inverse be g. Then 

1 =fs9 = a gfo + bgfi, 
that is, 1 = Af 0 + Bf v 

where A, B are polynomials in K[x\. 
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A non-zero element of K[ x\ which has no proper divisors is called 
a prime element of K[x], or an irreducible polynomial. Our work 
above now enables us to prove 

Theorem I. Iff is an irreducible polynomial in K\x\, and g, h are 
polynomials in K\x] such that f is not a divisor of g, but f is a divisor 
of gh, then f must be a divisor of h. 

Since / is not a divisor of g, the degree of any common divisor of 
these two polynomials is less than the degree of/. But/ is irreducible. 
Therefore the only divisors of/ have degree zero. Hence/ and g are 
relatively prime, and so we can find polynomials a, b such that 

I = a f+bg. 

Therefore h = afh + bgh. 

But /divides gh. Therefore, for some polynomial k, 

gh = fk. 

Hence h = ( ah + bk)f ’, 

showing that / is a divisor of h. 

We note here that the term factor is often used in the same sense 
as divisor , so that we may refer to the highest common factor of two 
polynomials instead of to their highest common divisor. 

We conclude this section with a theorem of frequent application. 
Let K* be an extension of the field K, and suppose that /, g are 
polynomials in K\x\, and therefore in K*[x\. If/, g are regarded 
as polynomials in K*\x\, the process of the division algorithm, when 
applied to / and g, is carried out entirely in the subring K\x\. Hence 
the process of finding the highest common factor of / and g in 
7f*[a;] can be carried out entirely in K[x\. Thus the highest common 
factor of/ and g in K[x~\ is the highest common factor in K*[x], and, 
conversely, the highest common factor of / and g in K*[x~\, when 
multiplied by a suitable element of K*, is in K\x\, and is the highest 
common factor of f and g in this ring. Hence, in particular, we obtain 

Theorem II. If K* is an extension field of the field K, and iff, g 
are two polynomials in K\x\ which are relatively prime, then f, g are 
still relatively prime when regarded as polynomials in K*\x\. 

7. Factorisation in an integral domain. We have defined a 
unit or regular element of an integral domain 7 to be an element a 
which has an inverse a -1 in I, so that 

aa~ l = 1. 

We saw that the inverse is necessarily unique. 
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Clearly, 1 and — 1 are always units of I. There exist integral 
domains, such as the ring of natural integers, with no other units. 
If, however, the integral domain is a field, every non-zero element 
m it is a unit. We shall make use of the following results in the 
sequel: 

Theorem I. The product of any finite number of units is a unit. 
If the product of any finite number of elements of I is a unit, each 
element in the product is a unit. 

Let e v ...,e r be units, ef x , . . . , e? 1 their respective inverses. Then 
e 1 ...e r ejf 1 ...e~ 1 = e x ef 1 e 2 e^" 1 ... e r e7 1 = 1. 

Hence e x . . . e r is a unit. 

If a 1 ...a r = e 

is a unit, then a x ... a r e _1 = 1. 

That is, a i {a 1 ... a i _ 1 e -1 a f+1 ... a r ) = 1. 


Therefore a i is a unit. 

We now define the term factor. If any non-zero element a of an 
integral domain I can be written as 


a = be, 


( 1 ) 


where b, c are in I, these elements are said to be factors of a, and a 
is said to be a multiple of b and c. When a is written in the form (1) 
it is said to be decomposed into the factors b and c. If one of the 
factors is a unit, the decomposition is said to be trivial. Since every 
I has at least one unit e v every element of I has at least one trivial 
decomposition a = e^e^a). 

An element of I which is not a unit may or may not have a non- 
trivial decomposition. If it has only trivial decompositions it is 
called a prime dement of I. (If I is the polynomial ring K[x\, the 
prime elements are the irreducible polynomials of degree greater 
than zero.) We prove that if any product a 1 ...a r of elements in I is 
a prime, then one of the factors is a prime, and the rest are units. 

Forif b = a 1 ...a r , 

and each a i is a unit, so is 6 [Th. I]. This is contrary to the definition 
of a prime. Hence one a t at least, suppose it is a lt cannot be a unit. 
Then since b is prime, b = a ^ a2 

can only be a trivial decomposition. But o x is not a unit. It follows 
that a 2 ...a r is a unit, and therefore, by Th. I, that a 2 ,a 3 , ...,a r are 
all units. 
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An element a of I is said to be completely factorised if it is expressed 

as the finite product 

r a = a 1 ...a r , 

where each is either a unit or a prime. By Th. I we can combine all 
the factors which are units into a single one. We then write this 
first, using a Greek letter to denote it, so that 


CL — CC&1 • • • GSy * 

implies that a is a unit, a v ...,a r are primes. In particular cases, of 
course, a may be unity. 

Two complete factorisations of the same element 


a = aa 1 ...a r , a = fib 1 ...b a 

are said to be equivalent if r = s, and for a suitable reordering of 
&i = e*a* (i = 1, ...,r), 


the factors 


where e v ...,e r are units. We now give the following definition: 

If an integral domain is such that every non-zero element can 
be completely factorised, and all complete factorisations of a given 
element are equivalent, it is called a unique factorisation domain. 
We shall abbreviate this to u.f.d. 

The ring of the ordinary integers is a well-known example of a 
u.f.d. But we now give an example of a ring, in which not all 
elements are units, which is not a u.f.d. Consider the set S of complex 
numbers which can be written in the form 


a — — 3, 

where a and /? are integers. It is immediately verified that this set 
is closed under the operations of addition and multiplication, and 
it is easily shown that it forms an integral domain. This integral 
domain is, of course, a subring of the ring of complex numbers. 
We define the norm N{a) of a by the rule 

N(a) = a 2 + 3/? 2 . 

N(a) is always a non-negative integer. The following properties are 
rapidly verifiable: 

(i) N(a) = 0 if and only if a = 0; 

N(a) =1 if and only if a = ± 1 ; 

N(a) = |=2 for any a in S; 

N(a) = 3 if and only if a = ± *J— 3; 

N(a) = 4 if and only if a = + 2, or a = + 1 ± <J— 3. 
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(ii) N(a)N(b) = N(ab). 

Since N(l) ~ 1, we deduce that if ab = 1, 

N(a)N(b) = 1. 

Hence N(a) = N(b) = 1, 

that is, by (i), a = ± 1, b = ± 1. 


Therefore the only units of the ring are + 1 and — 1. 
Again, if 


a = a x 




is a factorisation of a in which no factor is a unit, it follows from the 

fact that -mj. . . lT , . 

N{a) = N(a x )..../V(a r ) 

that r cannot exceed the number of prime factors of N(a). It 
follows from this that a can be completely factorised. 

(iii) The numbers + 2, ± 1 ± 3 are all primes. For if any one 

of them could be written as be, where neither b nor c is a unit, we 

should have , T/ , . , T/ . 

N(b)N(c) = 4, 


while N(b)>l and N(c)> 1, 


Therefore N(b) = N(c) = 2, 


and we saw in (i) that there are no elements in the ring with norm 
equal to 2. The elements in question are therefore all primes. Now, 
since the only units are + 1 and — 1, we cannot have the equation 

1 + — 3 = e2, 

where e is a unit. Therefore the factorisations 

(2)(2) = 4 = (1 + V— 3)(1— V— 3) 

are not equivalent factorisations of 4. The ring 8 is therefore not 
a u.f.d. 

We now return to the integral domain I, and assume that it is 
a u.f.d. We prove that any two elements a, b of / have a unique 
highest common factor. First of all let us call two primes p, q 
equivalent if there exists a unit e such that 

V = eg. 

Now let us examine the consequences of assuming that an element c 
is a factor of an element d, so that 

d = ce. 
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c = yc x . . . c r , 
d — — Sd-^ * • • dg f 


6 — 66 ^ • • • 6 ^ 


be complete factorisations of c, <2, e in which, as we agreed, y, 8, e 

denote units, all the remaining factors being primes. Then from 

the equation „ 7 , 

^ 6d 1 ...dg = yec 1 ...c r e l ...e t 

we deduce that r + t = s, and that the primes c lt . . . , c r are equivalent 

to r distinct primes of the set d ly ...,d a . 

Now let l /ji i. 

a = oui l ...a r , b = pb 1 ...b a 

be the complete factorisations of two elements a, 6 of I, and suppose 
that, by suitably choosing the arrangement of the factors, and the 
unit multipliers of the primes, we can say that 

but a } and b k are not equivalent for any j and k greater than t. By 

what has been proved above, any factor of a is the product of a 

unit and a certain number of the a it and similarly, any factor of 6 

is the product of a unit and a certain number of the 6 { . From the 

fact that I is a u.f.d. it follows that if d is a factor of both a and b, 

d can be written , * 

d 

where 1 < < i 2 . . . < »/ < t . Hence d is a factor of 

D = a x ... a t , 

which is itself a factor of a and b. This element D is called the 
highest common factor of a and b, and is uniquely defined, save for 
a unit factor. We now prove 


Theorem II. If a x a n are any n dements of I, there is an element 

d, determined to within a unit factor, which is a factor of each a if and 
is such that any factor common to all the a i is a factor of d. 

We shall call d the highest common factor of a 1 ,...,a n . The 
theorem has already been proved when n = 2. Let us suppose that 
it is true for n— 1 elements of I, a v ...,a n _ v and let d n be their 
highest common factor. Let d be the highest common factor of 
d n and a n . Then d is a factor of a n , and of d n , which is itself a factor 
of a v ...,a n _ v Hence d is a factor of a lt ...,a n _ v a n . Conversely, 
if e is any factor of a v ...,a n _ v a n , it is a factor of d n and a n> and 
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hence of d. It only remains to prove that d is unique. Now, if d' 
is another element of I with the properties of d, d' is a factor of 
a,, ...,a n , and therefore of d. Therefore 

d = ad’, 

where a is in I. Similarly — bd 

where 6 is in I, and hence , ,, 

d = aod, 

giving ab = 1, 

so that a and b are units. 

We conclude this section with the important 

Theorem III. If K is any commutative field, K[x\ is a u.f.d. 

We first prove that any polynomial in K[x\ of degree n > 0 can 
be written as the product of a finite number of prime factors. We 
proceed by induction on n. 

If n = 1 there is nothing to prove, since, if/ is of degree 1, and 

f = gh, 

where g, h are of degrees l, m respectively, 

l + m = 1 . 

Hence either g or h is of degree zero, and is therefore a unit, and the 
decomposition is a trivial one; that is, /is prime. 

We therefore suppose the result true for polynomials of degree 
less than n. Now let / be of degree n. If it is irreducible, then it is 
already expressed as a product of prime factors. If it is reducible, 

f = gh, 

where g, h are of degrees l, m, say, and 

l + m. = n (l<n, m<n). 

By hypothesis g and h can be expressed as products of a finite 
number of prime factors. Hence / can be so expressed. 

We now prove that all factorisations of / are equivalent. That is, 
if / is a polynomial of degree n which can be expressed in two ways 
as the product of prime factors 

/= «/i •••/•> / = /%!•••& in K), 

we prove that r — 8, and after a rearrangement of the factors, 

9i = Vifi (i=l,...,r). 
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We again proceed by induction on n. For n = 1 the result is trivial, 
since all polynomials of degree 1 are irreducible. Assume that the 
result is true for polynomials of degree less than n. Since 

a /i •••/r = fai ••• 9s> 

f x is a factor of g x or of ... g 8 [§ 6, Th. I], If it is a factor of the 
second polynomial, the same theorem tells us that it is a factor of 
g 2 or of Pg z ...g 8 . Continuing thus, f x is either a factor of some g i 
or of /?. This last conclusion is untenable, since f x has degree greater 
than zero, while /? is of degree zero. Hence, by re-ordering the factors 
g i% if necessary, we can say that 


9i = Vifv 


Since g x (as well as f x ) is irreducible, y x must be a unit. K[ x] is an 
integral domain, and so we can cancel the factor obtaining the 
equation , , 0 


between two polynomials of degree less than n. By our induction 

hypothesis , . . 

Jtr r— l=s— 1, that is r — s, 


and, after a suitable arrangement of the factors, 

9i = 7ifi (i = 2,...,r), 

where the y i are units. This proves the theorem. 


8. Factorisation in polynomial rings. The main theorem of 
this section, which we shall prove with the help of three lemmas, is 

Theorem I. If I is a u.f.d., so is I[x]. 

Let / be any non-zero polynomial in /[«], and suppose that d is 
the highest common factor of the coefficients of/. We can write 

f=dg, 

where g is a polynomial in I[x] whose coefficients have a unit as 
their highest common factor. Such a polynomial is called a primitive 
polynomial. It is clear that both g and d are uniquely determined, 
save for unit factors. The importance of primitive polynomials is 
brought out by the following lemma. 

Lemma 1. The 'product of two primitive polynomials is a primitive 
polynomial. 


up i 


3 
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Let / = a^+a x x+ ... + a n x!* t 

and g = b 0 +b 1 x+ ... + b m z m 

be two primitive polynomials, and suppose that the product fg is 
not primitive. Then the coefficients of this polynomial have a 
common factor d which is not a unit. If p is a prime factor of d, 
p must divide all the coefficients of fg. Now, / is primitive, and so 
not all the coefficients of / can be divisible by p. Let a r be the first 
coefficient which does not have p as a factor. Similarly, let b„ be 
the first coefficient of g not divisible by p. Consider the coefficient 
of af +s in the product fg. It is 

c r+» — a r^8 + °r— 1 ^«+l + a r+ 1 V- 1 +•••+«<) b r+s + d r+a b 0 

= a r b t +pc, 

where c is in I. By hypothesis c r+g has p as a factor, and therefore 
the product a r b s must be divisible by p. The prime factors of a r b t , 
since I is a u.f.d., are the prime factors of a r and of b s . Therefore 
either a r or b s is divisible by p, contrary to hypothesis. Our assump- 
tion that fg is not primitive is therefore false. 

Let K be the quotient field of I. By § 7, Th. Ill, we know that 
K[x ] is a u.f.d. We use this theorem, together with properties of 
primitive polynomials, to establish our theorem. Let <j> be any 
polynomial in K[x], and suppose that 

<p = a 0 + a 1 x+ ... + a n x n , 

where ol { = a { lb { (a { , b { in I). 

We now write the coefficients in <f> with a common denominator 

b = b 0 ... b n . 

If c i = b 9 ... 6i_x«i6 t+1 • • • b n , 

and /= c 0 + c 1 a;+... + c n a; n , 

then <f> = f/b, 

the polynomial /lying in I[x\. If now 

/ = ag, 

where g is primitive, we have expressed <j> in the form 
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Lemma 2. The primitive polynomial g is uniquely determined by 
the non-zero polynomial <f>, save for a factor which is a unit of I. 

Cb C 

Suppose that ^ g and also <f) = -^h, where g and h are both 
primitive in I[x]. Then 

adg = bch, 

and therefore ad is a factor of the coefficient of every term in bch. 
But since h is primitive, the highest common factor of the coeffi- 
cients is be. Hence , , 


where e is in I. Similarly, since be is a factor of ad, 

ad = e'bc, 

where e' is in I. Therefore 

bead = (ee')bcad. 

Since is not zero, and I is an integral domain, 

ee' = 1, 

and therefore e is a unit. Hence 


9 = eh, 


and the lemma is proved. 

If now the polynomial xjr in K[x] determines the primitive poly- 
£ 

nomial h, and rjr = ^h, then 




By Lemma 1, gh is primitive. Hence, by Lemma 2, the primitive 
polynomial determined by (j>\jr is gh. We therefore obtain 

Lemma 3. If <j> is an irreducible polynomial in K[x], the corre- 
sponding primitive polynomial in I[x ] is irreducible, and conversely. 

With these preparations we now come to the proof of our theorem. 
Let / be any polynomial in I[x], and let d be the highest common 
factor of the coefficients. Write 

f=dF, 

and consider F as a polynomial in K[x], which is a u.f.d. Let the 
prime factors of F in K[ x] be $ lt . . . , f> r . These are uniquely deter- 
mined, save for units of K. If 



3-3 
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where F t is the corresponding primitive polynomial in I[x\, then, 
by the above lemma, F t is irreducible in /[*]. We have 


F = a 1111 a 1I , 


F 

> -*• r) 


that is, 


. . . 6 , 

Aj » • • by F = flj » • • (ZyF^ . . . J^ r . 


Since F,F X , ...,F r are primitive polynomials, 6j...6 r is a factor of 
a x ... a r , and conversely. Hence 

a x ... a r = e6 x ... b r , 
where e is a unit of I, and therefore 

f — edF x ...F T . 

Writing ed = 8d x ... d s , 

where d v ...,d g are primes, the equation 

f = 8d 1 ...d s F 1 ...F r (8 a unit) 
expresses /as a product of prime factors. Now let 
/ = ee x ... e s . O x ... G,, (e a unit) 

be another representation of / as a product of irreducible factors 
in I[x]. By Lemma 3 the F t , Gj are irreducible in K[ x\, and therefore, 
since K[x] is a u.f.d., we must have r = r', and 

Q t = (a* a unit of K), 

when the factors have been suitably arranged. By Lemma 2 


<*i = e if 

where is a unit of I. Hence, finally, 

8d x ... d g — 66^ ... 6,' € x ... 6,, 

where 8 and ee x ... e r are units, whilst the d { , are not units. But 

I is a u.f.d. Therefore s = s', and after a suitable rearrangement 

of the factors, , « . 

di = {Oi a unit of I). 

This proves that /[a;] is a u.f.d. 

By a simple process of induction we then deduce 

Theorem II. If K[x v ...,x r ] is the ring obtained by adjoining the 
r indeterminates x v ...,x r tothe commutative field K, then K[x v ...,x r ] 
is a u.f.d. 
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9. Examples of fields. In constructing the spaces whose 
geometry we propose to discuss, the first step is always the selection 
of a field K. It is therefore appropriate to end this chapter with a 
few examples of fields. 

I. We have already noted that the complex numbers form a 
commutative field under the usual laws of addition and multi- 
plication. 

II. We know that the natural integers form an integral domain I. 
By § 4, this integral domain can be embedded in the quotient field 
of I, which is just the field of rational numbers. 

Both these fields are commutative and without characteristic. 
We now give an example of a field with finite characteristic. 

III. Let p be any prime integer. We saw in § 2 that the ordinary 
integers, reduced modulo p, form a commutative ring with unity. 
The zero 0 represents the set of integers which are multiples of p, 
and the unity 1 the set of integers equal to 1 (modulo p). We now 
show that every non-zero element of this ring has an inverse. If 
a is any integer not a multiple of p, a and p are mutually prime. 
Hence we can find integers b, c such that 


ab—pc = 1. 


If now a' = a (modulo p), b’ = b (modulo p), 

it follows that a’b’ = 1 (modulop). 

The set of integers modulop therefore form a field. This field is 
clearly of characteristic p. This example also illustrates the case of 
a field with a finite number of elements. 

IV. Let I be any integral domain with elements a,b , .... The 
quotient field K consists of elements a/b ( 64 = 0 ). The polynomial 
ring I[x\, consisting of polynomials 

a Q + a x x + . . . + a n x n , 

is an integral domain. There is also an integral domain K[x\, 
consisting of polynomials 


a n 


a i 


r^+ yi X+ ... + - 


a. 


h 


n x n 


( 6 q ... b n ^p 0 ). 


We show that the quotient fields of I[x] and K[x~\ are iso- 
morphic. 
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To each element 

f a 0 + q 1 a;+...+a n 
g b 0 + b 1 x+ ...+b m x m 

of the quotient field of /[«] there corresponds the unique element 
of the quotient field of if [a?] similarly represented. If/'/gr' is another 
representation of this element in the quotient field of /[#], so that 
fg' = f'g, the elements fjg and f'jg' are equal in both quotient fields. 
Conversely, any polynomial in K[x] 




b 0 bi 


can be written 


*- 


c 0 + c 1 a;+...+c n a: n / 


c c 

where = b 0 ... ••• b nJ c = b 0 ...b n . 


If, similarly, 




d 0 + d 1 x+. ..+d m x m g 


d 


d’ 


we have, in the quotient ring of if [a:], 

*L - ^( c O + c l a: +---+ c n a;W ) 

^ ~~ c(d 0 + d 1 a:+ ... +d m x m )’ 


so that 0/^ determines an element of the quotient field of /[a;]. 
If, similarly, 


and 

and 

then 


c'dfg' = cd'f'g, 


and therefore the element of the quotient field of I\x\ corresponding 
to (flxfi is the same as that corresponding jxjr' . The correspondence 

between the two fields is therefore one-to-one. It can be immediately 
verified that it is an isomorphism. 

The quotient field of if [a?] is denoted by K{x). It is called the 
field of rational functions, over if, of the indeterminate x. By the 
above argument, the field of rational functions of x % over the field 
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K{Xy) is isomorphic with (and can be identified with) the quotient 
field of K[x lt x 2 ], It is denoted by K(x v x 2 ). Proceeding, we see that 

K(x 1 ,...,x r _ 1 )(x r ) = K( x v ...,x r ) 

is isomorphic with the quotient field of K\x v ...,xf\. It is called 
the field of rational functions of x v . . . , x r over K. It is evident that 
the characteristic of this field is equal to that of K. 

V. Finally, we give an example of a non-commutative field. 

Let K be the field of the real numbers. Defining matrices as in 
§ 2, Example V, consider the set Q of matrices which are of the form 

( a 8 -p -y\ 

-8 a y -P 1 

P -y a -8 I' 

y P 8 a ) 

where a, /?, y, 8 are in K. Clearly, the sum of two matrices of Q is 
a matrix of Q, and if a, b are matrices of Q, the equation 

a + x = b 

has the solution x = b — a 

in Q. Thus the set Q forms a group under the addition law. Next, if 

a= / a 8 — P — y\ a* = / a* 8* — /?* — y*\ 

I -8 a y -p \ 1-8* a* y* — P* \ 

I P —y a —8 I’ l p* —y* a* —8*1 

\y p 8 a ) \y* p* 8* a* J 

the product 



where p — aa* — pp* — yy* — 88*, 

q = Pa* + aP* — 8y* + y8*, 
r = ya* + 8p* + ay* — p8*, 
s= 8a*—yP*+Py* + a8*, 
and therefore aa* is in Q. 

Since multiplication of matrices is associative, and distributive 
over addition, the set Q forms a subring of the ring of matrices. The 
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zero of Q is the matrix with a = /? = y = 5 = 0, and the unity of 
Q is the matrix with a = 1, /? = y = 8 = 0. If, now, a 4=0, and a* 
is given by 

a* = a(a 2 + /? 2 + y 2 + 5 2 ) -1 , 

P* = - / #(a 2 + /? 2 + y 2 -|-5 2 )- 1 ) 

y* = -y(a 2 + /? 2 + y 2 + £ 2 )-l, 

8* = — <?(a 2 + /? 2 + y 2 + £ 2 ) -1 , 

we find that 

aa* = 1 = a*a. 

Therefore every non-zero element of the set Q is regular : Q is a field. 
Let us write 1, i,j, k for the matrices corresponding to 

a = 1, /? = y = 5 = 0; /? = 1, y = 5 = a = 0; 

y = 1, 8 = a = P = 0; 5=1, a = /? = y = 0, 

respectively. Then the element a of Q written above is expressible 
in the form 

a = oc -t- pi + yj -t- 8k, 

and the product 

(a + pi + yj + 8k) (a* + p*i 4- y*j + 8*k) 
can be evaluated by multiplying out and using the relations 

i 2 = j 2 = k 2 = — 1, 
jk = —kj = i, 
ki = —ik = j, 
ij = -ji = k. 

In particular, since ij and ji are unequal, the field Q is not com- 
mutative. This field is usually called the field of quaternions. 
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LINEAR ALGEBRA, MATRICES, 
DETERMINANTS 

I n this chapter we investigate properties of a set of elements which 
can be combined according to certain rules to form elements of the 
same set. These rules involve the use of the elements of a field. In 
the applications which have to be made in later chapters we shall 
usually assume that the field used is commutative, and without 
characteristic. But the main results of this chapter are independent 
of any such assumption, and, unless an explicit statement is made 
to the contrary, our results will be true for any field whatsoever. 

We begin our investigation by selecting a field, which we denote 
throughout by K. We shall often refer to K as the ground field. 

1. Linear dependence. Having selected a ground field K, let 
L be a set of elements (not necessarily belonging to K) which have 
the following properties: 

(i) there is a law of composition in L, denoted by + , under which 
the elements form an additive group; the zero of the group is 
denoted by 0;, 

(ii) corresponding to each element of K there is an operation 
which transforms any element of L into an element of L. If a is the 
element of K, u the element of L, the result of operating on u with 
a is an element of L which is written au; 

(iii) the operations described above obey the following laws: 

(a) 1 u = w, 

(b) a(u+v) = au + av, 

(c) (a + b)u = au + bu, 

(d) a(bu) = ( ab ) u. 

A set L having these properties is called a left-hand linear set over K. 

From the definition, certain properties of linear sets can be im- 
mediately deduced. First, 

w+Ow = lw + Ow = (1 + 0) u= lu = u, 
for all u in L. Hence, for all u in L, 

Ou = 0. 



42 H. LINEAR ALGEBRA, MATRICES, DETERMINANTS 

Corollary. u+( — l)u= lw + (— 1 )m = 0m = O. 

Hence ( — 1 ) u is the negative of u as defined in the additive group. 
Secondly, if a is a non-zero element of K, 

u+aO -■ aa^u + aO 
= a(o _1 M+0) 

= a{a~hi) 

= u. 

Hence aO = 0. 

This result is also true if a = 0, by the first result. 

The converse of these results is immediate. If 

au = 0, 

and a #= 0, we have the equations 

u = lu = (a _1 o) u = a _1 (au) = a _1 0 = 0 . 

Hence, the equation au = 0 

implies either a = 0 or u = 0. 

Corollary. If au = bu and w=f=0, 

then o = 6; 

and if au — av and a =J= 0, 

then u = v. 

A right-hand linear set can be defined as above, except that all 
operations are performed on the right. We can deduce similar 
results to those proved above for left-hand linear sets. In the case 
of a commutative field K we shall not distinguish between right- 
hand and left-hand linear sets, and we shall write 

au = ua, 

and talk simply of linear sets. But in the following work we shall not 
assume that K is commutative. However, it will be sufficient to 
confine our attention to left-hand linear sets, since from each 
property of these sets a corresponding property of right-hand linear 
sets will be immediately deducible. Whenever there is no risk of 
confusion we shall simply call the left-hand Unear set L a linear set. 
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Any r members u v ...,u r of L are said to be linearly dependent 
over K if there exist elements a v ...,a r of K, not all zero, such that 

a x u t + ...+a r u r = 0; 

otherwise they are linearly independent. 

We note that no finite set of elements of L which includes 0 can 
be linearly independent, since, if the other members of the set be 
u v ...,u r , we always have the equation 

o0 + 0w, + . . . + OWj. = 0 (o =)= 0). 

If Mj, ...,u r are linearly independent, then any subset w^, 
consists of linearly independent elements. For the linear dependence 
of the subset would involve the linear dependence of the whole set. 
Any element v of L which can be expressed in the form 

v = a 1 u l +. .. + a r u r (G^iniT) 

is said to be linearly dependent on u v ...,u r . We shall denote the 
set of elements of L which are linearly dependent on u t , ...,u r by 
the symbol L(u v ...,u r ). The zero 0 is always in the set. We now 
prove 

Theorem I. L(u v . . ., u r ) is a linear set. 

If u, v are in the set, 

u = eqit, + . . . + a r u r , 

and v = b l u i + . . . + b r u r . 

Then ti + v = (cq + b^) Mj + .. . + (a r + b r ) u r . 

Hence u+v is in the set. Also 

au = aa x w, + . . . + aa r u r , 

and therefore au lies in the set. Since the negative of u in the 
additive group is equal to ( — 1 ) u, we see that the set L(u v ...,u r ) is 
an additive group and is closed under multiplication by elements 
of K. It is therefore a linear set. 

Theorem II. If u v ...,u r are linearly independent, the expression 
v = a x M, + ... +a r u r 

for any element of L(u v ...,u r ) is unique. Conversely, if the expression 
is unique , u v ...,u r are linearly independent. 
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If any element v of L(u v ..., u r ) can be written in two ways as 

v = a 1 u i + ...+a r u r , 
and v = u t + . . . + b r u r , 

we see that 0 = (a 1 — b 1 )u i + ... + (a r — b r ) u r . 

Since u v ...,u r are linearly independent, it follows that 

<*< = &* (* = 1,. ~,r). 

Conversely, suppose that the representation is unique, but that 

a i M i + ... +a r u r = 0. 

Then v = b 1 u t + ... + b r u r 

can also be written as 

v = {b-y + af) Ui + . . . + (b r + a> r ) Uy. 

From bi + ai = b t (i = 1, ...,r), 

we deduce that a { = 0 (i = 1, 

This proves that u v ...,u r are linearly independent. 

Theorem III. If v t ,...,v s lie in L(u v ...,u r ), and w v ...,w t 
lie in L(v v ...,v s ), then w v ...,w t lie in L{u v ...,u r ). 

For, from the equations 

v { = a n u t + ... +a ir u r (i = 1, ... ,s ), 

and b n v t + ... + b ia v s (* = 1, ...,t), 

we deduce that 

Wi = c a u t + ...+c ir u r (i = l,...,t), 

where * , 

% — li b ik a kj . 

k « 1 

Theorem TV. In any finite set u v ..., u r , not all 0, we can find a 
subset Ufa ...,Ui a , which is linearly independent, and such that 

...,M y) = L(U it , ...,Mj" g ). 

We prove this by induction on r. The theorem is evident if r = 1, 
since, by hypothesis, Mj #= 0, and is therefore linearly independent. 
Suppose now that the theorem is true for r — 1 elements, and con- 
sider the set Mj, ...,u r . If these elements are linearly independent 
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we choose them as the subset, and there is nothing to prove. Assume 
then that they are dependent, and that 

a 1 u t + ...+a r u r = 0, 

where at least one a t 4= 0. If a k =f= 0, we see that 
Ufa — — a k 1 a l u l — . . . — a k I 0’k-i u k-t ~ lo &+i w fc+i ~ ~~ a k 

Not all the elements u v . .., uj t+v ...,u r can be 0, otherwise 
Uff — 0 also. We can therefore apply the induction hypothesis to 
L(u v ..., W£_,, M* +I , ..., w r ), and we deduce that there exists a set 
of linearly independent elements w^, . . . , such that 

L{u v ...» j» ...yU r ) = L(Ui t , 

Clearly L{u^, ...,U{ t ) is contained in L(u v ...,u r ). Conversely, any 
element v of L(u v ...,u r ) can be written in the form 

v = byii^ + . . . + bfcU/f + . . . + b r u r 

= (bi - b k a k 1 aj M, + . . . + {b k - b k a k 1 a k ) u h + . . . + (6 r - b k a k 1 a r ) u r , 
and is therefore in 

i(M|, ...,U r ) = L(Uj t , 

Hence L(u v ..., u r ) = L(u ^ w,-J, 

and the theorem is proved. 

The elements v v ...,v s , whether linearly independent or not, 
are said to form a basis for the linear set L(u v ...,u r ) if 

L( v v ...,v s ) = L{u v ...,u r ). 

If the elements are linearly independent the basis is described as 
a minimal basis. We shall prove that the number of elements in a 

minimal basis for L(u t u r ) is the same for all minimal bases, 

and this number will be called the dimension of L(u v ...,u r ). 

The principal step in proving that the number of elements in a 
minimal basis is always the same is the result known as th ^Exchange 
Theorem. This has many applications in algebra. 

Theorem V. If v t , ...,v s are s linearly independent elements of 
L(u v ...,u r ), there is a subset w^, ...,w f - # of the basis u v ...,u r , such 
that if we exchange this subset for v v ...,v a the new set of r elements 
is a basis for L(u t u r ). 
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We prove this theorem by induction on a. Assume that the 
theorem is true for v v ...,v s _ v and suppose that these elements 

have been exchanged with u j sl , where i lt ...,i s ~ v is 

a derangement of 1, Then, by hypothesis, 

L{U |, ...,U r .) = i(®(, • • • > w,* r ), 

and contains v a . Hence 


v s — a l v i + • • • + ®a-i + a s u i, + • • • + a r u ir‘ ( * ) 

Not all the elements a s , ...,a r of K can be zero, since this equation 
would then imply the linear dependence of v s . We may, 

without loss of generality, assume that the derangement i v ...,i r 
has been chosen so that a B 4= 0. We can then write 

u i. = ~a B 1 a 1 v i - ... + a s “ 1 u s 

~ a 7 1 a s+l U i s+ 1 — • • • — a s" 1 a r M * r • ( 2 ) 


Now, v t V s _ t ,v s , U it+1 u ir lie in L{v t , w f> , ...,u ir ), 

and therefore, by Th. Ill, 

But conversely, equation (2) shows that w$ lies in 


and therefore 

L(v v s . u u it , v a , u ia+i , 

Hence 



L(u ^ , ...,u r ) L(v j, ^i,) 

= L{v 1 ,...,v s _ t ,v a , u ia+1 ,...,u ir ). 

The same reasoning can be applied to the case 8 = 1, when there 
has been no exchange. Hence the theorem is proved. 


Corollaries 

(1) Ifu v ...,u r is a minimal basis for L(u v ...,u r ), andv t , ...,v r 
are r linearly independent elements of this set, then 

L(u t ,...,u r ) = L(v v ...,v r ). 

(2) If v t , ...,«V+i are an V r + 1 elements of L(u v ...,u r ), they are 
linearly dependent. 
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If any r of the elements V{ are linearly dependent, the corollary 
needs no proof. Suppose, then, that®,, ...,t? r are linearly independent. 
Then v r+l is in L(u t , ...,u r ), which, by the Exchange Theorem, is 
the same as L(v t , ...,v r ). Therefore 

v r+i = «!«! + .. • +a r V r , 

and therefore i?,, ...,v r+l are linearly dependent. 

Note that this proof does not assume that w, u r form a 

minimal basis for L(u v ...,u r ). 

Theorem VI. The number of elements in a minimal basis for 
L(u v . ..,u r ) is the same for all bases. 

Let v v ...,v n , and w u . . . , w m be two minimal bases. Then 

M u l j • • •» Mf) = L(V j, . . .,V n ) = 

If n> m, Corollary (2) states that v v ...,v n are linearly dependent, 
and similarly, if m>n, w v . . . , w m are linearly dependent. But, by 
definition, the elements of a minimal basis are linearly independent. 
Therefore m = n. 

As stated above, this number is called the dimension of the linear 
set. As an evident corollary to the above theorem we have 

(1) If L, M are linear sets of finite dimensions l, m respectively, 
and L^M, then If L^M and l = m, then L = M. 

We now give an important example of a left-hand linear set over 
a field K. 

We consider the set of w-tuples 

(®1» • • • > a n) 

consisting of n elements a i of K. Two ^-tuples 
(a x , ...,«„) and {b v ...,b n ) 
are equal if and only if 

= bi (i = 1 n). * 

If u = (a v ...,a n ), and v = (b v ...,b n ) 

are two n-tuples, we define u+v by the equation 
u+v = («! + &!, ...,a n + b n ), 
and an (a in K) by the equation 

au = (aa v ...,a&f). 
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With these definitions it is evident that w-tuples form a left-hand 
linear set. If 

( 1 , 0 , 0 ,... , 0 ), 

11 % = ( 0 , 1 , 0 ,..., 0 ), 


u n = (0,0,0, ...,1), 

then u v ...,u n are linearly independent, and any w-tuple 
u — (a x , ...,a n ) is given by the equation 

u = a 1 u 1 +... +a n u n . 

Hence the set of ^-tuples forms a linear set of finite dimension n, 
for which u v ...,u n are a minimal basis. 

The elements of this linear set are usually called left-hand vectors, 
or left-hand n-vectors when we wish to emphasise the dimension of 
the set. We say that the set of vectors forms a left-hand vector 
manifold. In the same way we can define right-hand vectors, and, 
when the ground-field K is commutative, simply vectors. 

Theorem VII. Any left-hand linear set L(u v ...,u n ) of finite 
dimension n is isomorphic with the set of left-hand n-vectors. 

Since the linear set is of dimension n, u lt ...,u n are linearly 
independent. Hence [Th. II] any element u of the set can be written 
in a unique way as 

u = a{U t + ...+a n u n . (3) 

There corresponds to u, therefore, the unique vector 

(ctj, . ,.,a n ). 

Conversely, the n-vector above determines the unique element 
of the linear set given by (3). The proof that this one-to-one corre- 
spondence is an isomorphism is trivial. 

It will be noted that the correspondence between the linear set 
and the set of n - vectors depends on the choice of a minimal basis 
in the linear set. 

In the remainder of this chapter, and in the applications to be 
made later, we shall have to consider simultaneously the properties 
of a finite number of subsets, each of finite dimension, belonging to 
some linear set L. Now, if we choose a minimal basis for each of 
these subsets, we have an aggregate which consists of a finite number 
of elements of L. These elements define a subset L' of L, of finite 
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dimension, which contains each of the given subsets. All our 
operations will take place in L' . We may always assume, therefore, 
that we are dealing with a linear set L' of finite dimension. By 
Th. VII, L' is isomorphic with a vector manifold. For this reason we 
shall usually speak of the elements of L' as vectors, and in this sense 
the discussion in the following paragraphs is related to vectors. 
We shall not state the dimension of the manifold of vectors explicitly, 
since our results will not depend on it; but on occasion it may be 
necessary to assume that the dimension is ‘sufficiently high’. 

2. Matrices. A matrix of p rows and q columns over a field K 
(called, for brevity, a, pxq matrix) is an array 



of pq elements of K, arranged in rectangular form. In § 2 of Chapter I 
we met, by way of example, the special case in which K is the field 
of complex numbers, and p — q. 

The theory of matrices is closely related to the theory of vectors, 
and we shall use the results of the preceding section to develop 
properties of matrices. The connection with vectors is established 
in this way. We consider in some vector manifold of finite but 
sufficiently high dimension, which need not be specified, a set of q 
left-hand vectors u v ...,u q . By means of the matrix A we can 
determine from these q vectors p vectors v x , ..., v p where 

v i = a n u 1 + ...+a iq u q (i '= 1, ...,p). (1) 

Then L{v v ...,v p )^L(u x , ...,u q ). (2) 

The individual vectors v i are uniquely determined from the vectors 
u t by means of the matrix A . The set v v ...,v p is called the transform, 
of the set u v ...,u q by A. We notice, on the other hand, that if we 
are given condition (2) the matrix A is uniquely determined if and 
only if u x ,...,u q are linearly independent. This is an immediate 
consequence of § 1, Th. II. 

Sets of equations such as (1) will occur very often in this book, 
and it is convenient to have an abbreviated notation for them. If 
we generalise the notion of a matrix so that the elements may be 
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vectors instead of elements of K, and let U, V be respectively the 
q x 1 and p x 1 matrices 



we write (1) in the form 

V = AU. (3) 

We shall see later that in writing A U for the pxl matrix 


( a ll w l+ ••• + a lq u q 

a pl u 1 + ...+a pq u ql 


we are using a special case of the law of multiplication of matrices. 
Finally, in writing a q x 1 matrix, such as U, in full, it is sometimes 
typographically convenient to write 


U = Ug\, 


the square brackets indicating that the elements enclosed should 
be read as a matrix of one column. 

Now let B be any other p x q matrix over K, and use it to define 
p vectors w v ...,w p , where 

[«?!,.. .,w p ] = BU. 

Then 

Vi + Wi= («i i + b n ) % + ... + (a iq + b iq )u q (i = 1 p). 

These equations can be abbreviated to the equation 


V+ W = (A + B) U, 


if the sum of two pxq matrices is defined by the equation 



a lS + &lg' 
a 2a + ^2a 


a PQ 
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With this definition of addition for p x q matrices, we see that 
pxq matrices over K form an additive group. The zero of the group 
which, without risk of confusion, may be denoted by 0, is the zero 
pxq matrix, that is, the pxq matrix whose elements are all zero. 

Now let A be a p x q matrix over K, and let B be a q x r matrix 
over K. Let %,..., u r be r vectors and v 1} ..., v q the vectors defined 
by the equation y _ ^ 

where now U = [u v . . . , tt r ] and V = [v v ...,t> 9 ]. 

Similarly, consider the set of vectors 

W = ,w v \ 

defined by the equation 

W = AV. 

Then L(w lt ...,w p )£ L{v v L(u v . . . , u r ), 

and hence there must be a p x r matrix G such that 

W = CU. 

As in § 1, Th. Ill, we may take 

C - (ct,), 

a 

where c i} = I,a ik b kj (i = 1, ...,p;j = 1, ...,r). (4) 

k = 1 

If we formally eliminate V from the equations 

W = AV, V = BU 
by writing W = A(BU) = ABU, 

this suggests that we define the product A B of A and B to be the 
pxr matrix G, whose elements are given by means of (4). We see 
that the product of a p x q matrix and anrxs matrix is only defined 
when q = r, and that the result is a p x s matrix. If A B and BA are 
both defined they are not necessarily equal. 

We note here that AB is said to be obtained from A by post- 
multiplication by B, and that AB is obtained from B by pre- 
multiplication by A. 

Matrices do not form a ring, since the sum of two matrices is 
only defined when each is a p x q matrix, and the product is only 
defined when the number of columns in the first matrix is equal to 
the number of rows in the second matrix. If we restrict ourselves to 
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square nxn matrices we do obtain a ring, as we shall see in the 
next section; but certain associative, distributive and commutative 
properties also hold for all rectangular matrices. 

Let A, A v A a denote pxq matrices, let B, B v B a denote qxr 
matrices, and let C denote an rxs matrix. If 

BG = D, AD = E, 

r 

then d i} = £ h u c ti (i = 1, ...,g; j = 1, ...,s), 

t-i 

and therefore 

Q 

= S ^iu^uj == 1) '"iPii = !>•••» 
u — 1 


9 r 

= S S a iu^ut C !} 

u-lt-1 

r 

= S fa c t}> 

t - 1 

where F — (f i{ ) — AB. Hence 

A(BG) = (AB)G, 

and the product can be written as ABC without ambiguity. This 
is the associative law for the multiplication of matrices. 

By direct calculation we may easily prove that 

A(B i + B a ) = AB 1 + AB a , 

and (A 1 + A 2 )B = A l B + A a B. 

These are the distributive laws for matrices. 

If A is a pxq matrix, and i v ...,i p ., j v ...,j q - are any integers 
satisfying the relations 

1 <i 1 <i 2 ...<i p .<p, 

Kj !<;«••• 3. 

we can obtain from A a new matrix of type p' x q' by striking out 
all the elements which do not lie in the 

ijth, i 2 th, ..., or ^.throws, 

and in the ^th, j 2 th, ..., or j^th columns. 


The matrix we obtain is called a submatrix of A. 
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Now let p v . . . , p 8 , q v . . . , q t be s + 1 positive integers such that 

8 t 

i i 

Let the submatrix of A whose elements are in the 

^E^fc + ljth, +2jth, ^S^fcjth rows, 

and the ^ £ 9k + ijth, ^ E ?* + 2^th, | £ (Zfcjth columns 

be denoted by Ay. The submatrix Ay is a p t x q } matrix, and in 
an obvious notation we can write 



The matrix A may then be described as divided into blocks, corre- 
sponding to the numbers p v ...,p 8 , q v ..., q t . 

If B is a qxr matrix, let r v ...,r u be a set of positive integers 

U 

such that £ = t. We may then divide B into blocks corresponding 

i 

to the numbers q v ...,q t ; r v ...,r u , and write 



Now, A ik is & Pi* q k matrix, and B kj is a q k xr f matrix. Hence 
A ik B k] is defined, and is a p i x r } matrix. We can therefore define 
a Pi xr j matrix Gy, given by the equation 

@ij = S A ik B k j. (5) 

k - 1 

Consider the element in the Zth row and mth column of Cy. It is 

Q 

S ^hn^nk’ 
n — 1 

t-1 j-1 

where h = p a + 1, k= 2 r a +m. 

o = 1 o=l 

Thus /C n . C lu 

W . 0 m 

is simply the division of C = AB into blocks corresponding to the 
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numbers p v r v ...,r u . We may therefore regard A and B as 
8xt and txu matrices whose elements are matrices, and their 
product is defined by (5). This is a definition similar to that given in 
(4). But it must be remembered that (5) is only possible when the 
submatrices of A and of B are chosen in the way we have indicated. 

Similar, but simpler, reasoning shows that if A and B are two 
pxq matrices divided into blocks corresponding to the same set of 
numbers, thus 

A = /A n . u4 x A , B = 

V^8l • AgtJ 

then A + B = /A n + B n . 

\-^8l "h -®sl • 

The device of considering matrices divided into blocks is often used. 
The operations discussed above are the only ones which will be 
employed, and the matrices concerned will always be matrices with 
elements in a field. 


/B u 

B lt ' 

Ux • 

Bgt, 


^8t “t” ^8il 

!' 


3. Square matrices. A matrix in which the number of rows 
is equal to the number of columns is said to be a square matrix. If 
we consider only nxn matrices, we see that two such matrices can 
always be added or multiplied, and the result is an n x n matrix. 
The properties demonstrated in §2 enable us to say that nxn 
matrices over K form a ring. If we refer to I, § 2, Example V we see 
that the ring is non-commutative, and has divisors of zero. But it 
has unity, namely, the unit matrix 


where 

and 


In ~ /I ^ • 0\ — 

0 1 . 


.0 0 


= 0 (i * j)» 

$ a =1 {i — 1, ...,n). 


The symbol $ {j defined in this way is frequently used in algebra. 
It is called the Kronecker delta. 
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In accordance with the terminology of I, § 3, an n x n matrix A 
is said to be regular if it has an inverse A~ x such that 

AA - 1 - A~ X A = I n . 

From the theory of rings developed in the previous chapter, we 
know that a matrix cannot have two distinct inverses. We also 
see that if both A and B are regular, having inverses A~ x , B~ l 
respectively, then AB is regular, with inverse B~ x A~ l . For 

(. AB ) (B~ X A~ X ) = ABB-'A- 1 = AI n A~ x = AA~ X = I n , 

by the associative law. Similarly, 

(B-'A-')(AB) = I n . 

We now obtain two theorems giving necessary and sufficient 
conditions for annxn matrix A to be regular. 

Theorem I. An nxn matrix is regular if and only if it transforms 
a set of n linearly independent left-hand vectors into a set of n linearly 
independent vectors. 

( 1 ) Let u v . . . , u n be a set of n linearly independent vectors. 
Suppose that the n vectors 

v i = a il u 1 + ... +a in u n (i=l,...,n) 
are linearly independent. By § 1, Th. V, Cor. (1) 

B(v ... t v n ) = L(u v ...,u n ). 

Hence u { = b il v 1 + ... + h in v n ( i = 1, 

where b if is in K. Then 

n n 

Vi = 2 aqb, x t>i + ... + s a ijb jn v n , 
j *i i “l 

n n 

and Ui = f ••• + S bija Jn u n . 

i-i j-i 

Since both the sets u v ...,u n and v v ...,v n are sets of linearly in- 
dependent vectors, each of the two equations above must be an 
identity. Therefore 

n n 

£ a ijbjk = $ik> an d £ b i} a ik = 8 ik . 
l-i l-i 

In matrix notation this is simply 

AB = I n = BA. 

Therefore A = (a {] ) has an inverse B, and is regular. 
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(2) Now suppose that A is regular, and B is its inverse. Let 
u v ...,u n be a set of independent vectors, and let 


v t = an + +a in u n (i=l,...,n); 
then L(v v ...,v n )^L(u v ...,u n ). 

Now b il v 1 + ...+b {n v n 

n u 

= S b ij a jl u 1 +...+ S b i} a jn u n 
j — 1 j - 1 

*= ••• +^in w n 


Hence -£(%, ...,u n )^L{v v ...,v n ), 

and therefore L(% 1( ...,u n ) = L(v x , . . . , v n ). 


Since u v ...,u n are hnearly independent, it follows from § 1 , Th. VI, 
Cor. 1, that v v ...,v n are linearly independent. This completes the 
proof of the theorem. 

If G is any pxq matrix „ . . 

^ ““ \ G ij)f 

we may regard the p rows of G 

( c il> •••> ^iq) 


as p left-hand g-vectors. We speak of these as the q-vectors given by 
the rows of C, and we shall always regard them as left-hand vectors. 
Our second criterion for a regular matrix is given by 

Theorem II. An nxn matrix A is regular if and only if the 
n-vectors given by the rows are linearly independent. 

As in the previous theorem we denote n linearly independent 
vectors by u v ...,u n , and the transforms of these by A by v v ...,v n . 

(1) Suppose that the ^-vectors {a a , ...,a in ) are linearly in- 
dependent. Then if 

n 

l,b { a ii = 0 (j = 1, ...,»), 

i-1 

we must have b { = 0 (i = 


Now if v v ..., v n satisfy a relation 

b 1 v 1 + ...+b n v n = 0 ^inA), 

this implies that 

n u 

S biO'n'U'i + ...“I" 2 
i-1 i-1 
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3 . SQUARE MATRICES 
Since u v ...,u n are linearly independent, 

n 

2My = ° 0'= i, 

and therefore, by the above, 

6, = 0 (i=l,...,n). 

Therefore v v ...,v n are linearly independent, and, by Th. I, A is 
regular. 

(2) Suppose that A is regular. Then v v ...,v n are linearly in- 
dependent. Now, if the n- vectors (a iv . . . , a in ) are linearly dependent, 
there exist elements b v ..., b n of K, not all zero, such that 

n 

2My = ° 

i - i 

Then b 1 v 1 + ... + b n v n 

n n 

= 2 b i a il u 1 + ... + 2 bi(i in u n 

i-l i - 1 

= 0 . 

This implies the linear dependence of v v ...,v n . From this contra- 
diction we deduce that the n-vectors given by the rows of A are 
linearly independent. 

The methods used above also give us 

Theorem III. If A is a pxn matrix, and the left-hand n-vectors 
determined by the rows of A determine a vector manifold of dimension r, 
the manifold L(v v ...,v p ), where 

Vi = a t 1 u 1 + ...+a in u n 

is also of dimension r, provided that u v ...,u n are linearly independent. 
If the vectors defined by the rows of A be 

«<= (aa»—» a in) (*= 


then a { , ...,a it , the vectors of any subset of the vectors a v ...,a p 
are linearly dependent or independent according as the corre- 
sponding vectors v {l , are linearly dependent or independent, 
and conversely. For 

_ S Mi = .2. 2 = 0 

ii, it, ...,U jf*™l 


if and only if 


2 My = 0 = 


since u v ...,u n are linearly independent. 
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It follows that if the maximum number of independent vectors 
in the set a v ...,a p is r, the maximum number of independent 
vectors in the set v v ..., v p is also r. This proves the theorem. 

Theorem IV. If A is a pxn matrix, and the p left-hand vectors 
determined by the rows of A are linearly independent, then p^n, and 
there exists an (n —p) x n matrix B such that thenxn matrix 

0 

is regular. 

We have seen that a minimal basis for the manifold of n-vectors 
is given by the n vectors 

($iV •••» ^in) (* = 1> •••» n )' 

It follows that the maximum number of linearly independent 
n-vectors is n. Therefore p<n. Now let u v ...,u n be n linearly 
independent vectors and let 

v i = a n «i + — +a in u n (i=l,...,p). 

Then, as in the above theorem, v v ...,v p are linearly independent. 
Applying the Exchange Theorem [§ 1, Th. V], we deduce that there 
is a derangement i v . . . , i n of the numbers 1, ...,n such that if 

Vj = u i} (j =p+l,...,n), 

then v v . . . , v n form a basis for L(u v u n ). This basis is necessarily 

minimal. Hence, by Th. I, 

Vi = c il u l +...+c in u n (i=l,...,n), 
where C = (c y ) is regular. But 

where B is the ( n—p)xn matrix whose Mh row is 

^ip+hl’ "’f^ip+hn)' 

This establishes the theorem. 

4. Transformations of a matrix. Now let A be a p x q matrix. 
If u x ,...,u q are q linearly independent vectors, A transforms them 
into p vectors, v v ..., v p , say, and 

L(v v ...,v p )^L{u 1 ,...,u q ). 

If we are concerned with the vector manifolds L(v v and 
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L(u lt u Q ), we are at liberty to change the bases in these manifolds 
without altering the manifolds themselves. Since u v ...,u Q are 
assumed to be linearly independent, a new set of vectors u* y 
forms a minimal basis for L(u v ..., u q ) if and only if 

U=QU*, (1) 

where, in the notation of §1, U = uf\, U* = [«*, 

and Q is a regular qxq matrix [§ 3, Th. I]. 

If v v ...,v p are linearly independent, a similar argument applies 
to L(v v ...,v p ). The set vf, is then a minimal basis for 

L(v v . . . , v p ) if and only if ym = py ^ 

where P is a regular pxp matrix. This result requires modification 
if v v ...,v p are not linearly independent. But in the case we are 
considering P is a regular pxp matrix, and if 

V* = PV, then V = P~W*, 

and therefore 

L(vf, L(v v ...,v p )^ L(vf , . . ., v*), 

so that L(vf , . . ., v*) - L(v v ...,v p ). 

Hence, in order that the equation (2) should represent a change of 
basis in L(v v ...,v p ), it is a sufficient condition that P should be 
regular. Simple examples show that it is not a necessary condition 
when v v ...,v p are not linearly independent. 

Equations such as (1) or (2), where Q and P are regular matrices, 
are said to define allowable transformations of the bases u v ...,u q 
and v v ...,v p of L(u v ...,u q ) and L{v 1 ,...,v p ) respectively. These 
allowable transformations replace the matrix equation 

V = AU 

(which connects v v ...,v p with u v ...,u q ) by 

V* = A*U*, 

where A* = PAQ. 

We note that 

(l) A “ l p Al qi 

(ii) if A* = PAQ (P and Q regular), 

then A = P-'A*Q~ l \ 

(iii) if A* = PAQ and A** = P*A*Q*, 

then A** = (P*P)A(QQ*). 
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Thus the relation between A and A* is reflexive, symmetric and 
transitive. It is therefore an equivalence relation [I, §2]. Our object 
in this section is to determine a set of canonical forms for pxq 
matrices. In other words, we shall obtain a set of p x q matrices 
with the properties 

(1) No two matrices of the set are equivalent, in the above 
sense. 

(2) There is a matrix of the set equivalent to any pxq matrix. 
We first establish the existence of a numerical character which is 

invariant under our equivalence relation. 

Let r A denote the maximum number of q- vectors given by the 
rows of A which are linearly independent. If these g-vectors are 
all zero, r A = 0. By § 3, Th. Ill, we see that r A is also the maximum 
number of linearly independent vectors of V. That is, the dimension 
of L(v j, ...,v p ) is r A . But since 

L(v v ...,v p ) = L(vf, ...,«*), 

it follows that r A is also the number of linearly independent vectors 
in the set v * , . . . , v*. Therefore r A =, r A %. We notice that the equations 

V* = PAU, V* = PAQU* 

determine the same vectors of 

L{u lf ...,u q ) = L{uf,...,u*). 

Therefore the number of independent rows of PA is equal to the 
number of independent rows of A* = PAQ, and both these are 
equal to the number of independent vectors in the set vf, 
which is r A . Hence, calling the number r A the row-rank of A, we 
have 

Theorem I. The row-rank of a matrix A is equal to the row-rank 
of PAQ, where P and Q are regular matrices. 

This includes the cases in which Q = I q , P = I p , when we have, 
respectively, the matrices PA, and AQ. Pre-multiplication or post- 
multiplication of A by a regular matrix does not alter its row-rank. 

Now, the replacement of the vectors v v . . ., v p by the same vectors 
v ix , . . . , in a different order is a transformation 

Vi = PiV, 

where P x is a regular matrix. For, since every permutation of the 
numbers 1 ,...,p has an inverse permutation, P x must have an inverse. 
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We choose P± so that the first r A permuted vectors are linearly 
independent. Let us denote the permuted vectors by 

Hi 0'= L-, 2>). 

Then since the vectors 


Vij {j = r A +l,...,p) 

are linearly dependent on the first r A , we have the equations 

»u 



the matrix P 2 is regular. If we now put 

f 2 = P 2 F X , 

calling the transformed vectors v 2x , ...,v 2p , then of these, the 

(i = l,...,r A ) 

are linearly independent, but the rest are zero; that is, 
v z.i = ° (j = r A + l,...,p). 

Since F 2 = P 2 F X = P 2 P X F = P 2 P 1 AU 

= A 2 U, 


say, and u v ...,u q are linearly independent, the last p — r A rows of 
A 2 must be zero. We therefore write 



where C is a r A x q matrix. 
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By the Exchange Theorem we can replace r A of the vectors 
Mj, ..., u a by the linearly independent vectors v 21 , ..., v 2 fu , and thus 
obtain a new basis for L(u lt ...,u Q ). Before doing this we make a 
transformation jj _ q jj 

which merely permutes the vectors u lt ...,u Q , so that those to be 
replaced are the first r A vectors. The exchange is then represented 
by another transformation 

U x = Q,U t> 

where Q 2 is regular. Hence, finally, we have 


— A 2 Q X Q 2 U 2 
- U t ; 

that is, V 2 = (D\ U 2 , 

\0/ 


say, where D = (d i} ) — CQ t Q 2 . Now consider the first r A of the 
p equations given by this last matrix equation. Since 


we have 
Since u 21 , 

Hence 
That is, 


= (i = l,-~>r A ), 

Q 

u 2 ,i — 2 dijU 2 j (i = 1, ...,r A ). 
i "i 

u 2 a are linearly independent, we must have 
di] = $ij (i = 1> •••»^)> 

oto :)• 

P 2 P 1 AQ 1 Q 2 = ^i rA Oj 


Since Pv P%> Q v Q 2 are regular matrices, 

P = P%Pi and Q — Q X Q 2 


are regular. We have therefore proved 

Theorem II. If A is a pxq matrix of row-rank r A , A is equivalent 
to the matrix 


PAQ - l I rx O' 
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It is sometimes convenient to effect the transformation (or 
reduction) of A to this canonical form by a series of elementary 
steps, or elementary transformations. We define these as the following 
transformations of the basis w lt ...,w r (not assumed to be minimal) 
of a vector manifold L(w v . . . , w r ) : 

(i) the interchange of two elements w { , Wp 

(ii) the multiplication of an element w t of the basis by a non-zero 
element of K\ 

(iii) the addition to any element of the basis of any vector linearly 
dependent on the remaining r — 1 elements of the basis. Each of 
these transformations of the basis is called an elementary trans- 
formation, and is represented by the equation 

W* = RW. 

The matrix R is regular, for the inverse of an elementary trans- 
formation is easily constructed, and is itself an elementary trans- 
formation. It follows that the result of any number of elementary 
changes of basis replaces the basis by an equivalent basis, and is 
reversible. 

We now consider the equation 

V = AU, 

Q 

that is, = £ a i} u } (i=l,...,p). 

Unless r A = 0, there is at least one element of A, say a i} , which is 

not zero. By performing an elementary change of basis of type (i) 

on the bases u v ...,u q and v x v p , we obtain new bases, which we 

denote by , 

u 1>v ...,Ui ta and v 1>v ..., v liP , 

and a relation V x = A X U X {A x = (% t fj), 

where a lu = a i} =t= 0. By a transformation of type (ii) applied to 
u x l we may arrange that a 1>u = 1. We then perform the trans- 
formation of type (iii) on u n , u lQ which is given by the equations 

3 

U 2,l = M l,l + S a i,l i U l,t> 

<- 2 

u w = u i,i U = 2 > •••>?)» 
so that we now have the matrix equation 


F x = A 2 U t , 
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where A a — /ii 0\ 

\0 b}> 

and C is a (p — 1) x 1 matrix, 

G = (c {J ) (i = 2,...,p, j = 1). 

Now perform the transformation 

= w i,i> 

v 2 , i = v 1>i -c {1 v 1>1 (i = 2,...,p), 

on v 1>v ...,v l p . This is a series of transformations of type (iii). We 
obtain the equation 

v a = A a u a , 


where 



Since the number of independent rows of A 3 is still r A , it follows at 
once that B is of row-rank r A — 1. 

We now show by induction that the matrix A can be reduced to 
canonical form by elementary transformations on the bases of 
L(v lt ...,v p ) and L(u v ...,u g ). There is nothing to prove when r A — 0, 
since A must then be a zero matrix. Let us suppose that the theorem 
is true for any matrix whose row-rank is less than r A . By what has 
been proved above we can reduce A , by elementary transformations, 
to the form 



where B is of rank r A — 1 . By the induction hypothesis we can reduce 
B to canonical form by elementary transformations applied to 
v 2 2 , ...,v 2 p and u 2 2 , ...,u 2 a . But these transformations can also be 
regarded as elementary transformations applied to v 2 l ,v 2 2 , ...,v 2 p 
and This proves the theorem. 

As a corollary we have 

Theorem III. Any allowable transformation 

U* = PU 


of the basis of a vector manifold L(u v ...,u q ) of dimension q is the 
result of a finite succession of elementary transformations. 

For we can apply the result proved above to the set of vectors 
and to the set u v ...,u Q , and reduce the matrix P to 
canonical form. Suppose that a succession of elementary trans- 
formations, corresponding to matrices on the vectors 
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u*, together with a set of elementary transformations, cor- 

responding to matrices R 1 ,...,R b , on the vectors %, ...,u q , trans- 
forms the first set of vectors into v*, and the second set into 

v v ...,v q . Then if the matrix expressing the relation between the 
vf and the v i is in canonical form, we have the equations 

vf = v { (i = l,...,r P ), 

But since P is regular, r P = q. Therefore the transform of 

U* = PU 


is 


V* = I q V. 

But 

y* 

= QaQa-1 ••• Ql U*9 

and 

V 

= R b R b „i ... R 1 U, 

Hence 

Qa Qa-1 

...Q^R^.-.R,^ 

that is, 

P = 

Ql 1 • • • Qa 1 




Since is the matrix corresponding to an elementary transforma 
tion, so is Qf 1 . This proves the theorem. 


5. The rank of a matrix. Until now, our theory of matrices 
has been based on the consideration of left-hand vectors, and in 
particular we have been led to consider the rows of a p x q matrix 
as left-hand q- vectors. We now consider how our results would be 
affected if we based our investigation on right-hand vectors. 

The theory of right-hand vectors is immediately deducible from 
the results of § 1. If x v ...,x p ar ep such vectors, they are said to be 
linearly independent over K if the equation 


x 1 a 1 + ... + x p a p = 0 (a* in If) 

implies the equations 

= 0 (i = l,...,p). 

Also, if 

y = x 1 a 1 + ...+x p a p , 

y is said to be linearly dependent on x p . We then prove the 

analogues of § 1 , Th. I- VII, noting in Th. VII that a set of right-hand 

vectors of dimension p is isomorphic with the set of right-hand 

©-tuples . . 

1 r (%,..., a p ), 

in which multiplication on the right is the only kind admitted. 


IT O T 
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A pxq matrix A transforms the p right-hand vectors z v ...,x p 
into the q vectors y x , .... y q , where 

p 

y i =Zx i a ij (j = 1, •••>?)• 

Proceeding from this equation we can deduce results similar to those 
obtained in §§ 2-4. The most important one from our point of view 
is that if x v ...,x p are linearly independent, the number of linearly 
independent vectors in the set t/ x , ...,y 9 is the number of linearly 
independent columns of A, regarded as right-hand vectors. We call 
this the column-rank of A. Just as in §4 we show that if A is of 
column rank s A , there exist regular matrices P and Q such that 

:)■ 

But we saw in § 4 that the row-rank of A is equal to the row-rank 
of PAQ, which in this case is clearly s A . Hence we have 

Theorem I. In any pxq matrix A the number of linearly inde- 
pendent rows, regarded as left-hand vectors, is equal to the number of 
linearly independent columns, regarded as right-hand vectors. 

This number we now call the rank of A. We recall that by §3, 
Th. II, an nxn matrix is regular if and only if its rank is n. 

The rows of A may be considered as right-hand vectors, and the 
columns as left-hand vectors if we interchange rows and columns. 
We define the transpose A' of A as the qxp matrix 

A' = (a'ij)) 

where a' i} = a H (i = 1 , ...,q; j = 1 , ...,p). 

Clearly, the number of linearly independent rows of A' , regarded as 
left-hand vectors, is equal to the number of linearly independent 
columns of A, regarded as left-hand vectors. The number of linearly 
independent columns of A', regarded as right-hand vectors, is 
equal to the number of linearly independent rows of A, regarded as 
right-hand vectors. By the theorem above these numbers are equal, 
and give the rank of A' . The consideration of the case when K is 
commutative, and left-hand and right-hand vectors are identical, 
gives us 

Theorem II. If the ground field K is commutative, the rank of a 
matrix A is equal to the rank of its transpose A'. 
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On the other hand, if K is not commutative, this theorem is not 
necessarily true. For K contains at least two elements a, b such that 

ab =f= ba. 

Neither a nor b is zero. Consider the 2x2 matrix 



A = 11 a \ 


\6 ab)' 

If 

p{l,a)+q(b,ab) = 0, 

then 

p + qb = 0, 

and 

pa + qab = 0. 

Hence 

— qba + qab = 0, 

that is, 

q(ab — ba) = 0. 


Since if is a field, and ab 4= ba, we deduce that q — 0. Therefore 

p = —qb = 0. 

Hence the number of independent rows of A is 2, and its rank is 2. 
On the other hand, the equation 

(1 ,a)p + (b,ab)q = 0 

is satisfied by p = b, q = — 1, and so the rank of A' is 1. 

Our final theorem in this section is often useful: 

Theorem III. If A is a pxq matrix of rank r, A contains a sub - 
matrix of type rxr which is of rank r. No submatrix of A is of rank 
greater than r. 

If A is of rank r it has r linearly independent rows. Every other 
row of A is linearly dependent on these. If the submatrix defined 
by the r independent rows be denoted by B, B is an r x q matrix 
of rank r. Recalling the proof of § 3, Th. IV, we see that there exists 
a (q-r)xq matrix G such that the matrix 



is regular, where the Ath row of C is 

(tj, ...,i q ) being a derangement of (1, ...,q). Since D is regular, its 
rank is q, and hence its columns are linearly independent, regarded 
as right-hand vectors. In particular, the i x th, . . . , i r th columns are 
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linearly independent. These columns contain only zeros in the last 
(q—r) rows of D. It follows that the columns of the rxr submatrix 
E, which consists of the i x th, . . . , i r th columns of B, are independent. 
That is, E is of rank r. Hence the first part of the theorem is proved. 

If F is a submatrix of rank a > r, it contains a independent rows. 
Let these lie in the j x th, ...,jf 8 th rows of A. Since a>r, there exist 
elements 6 i of K, not all zero, such that 

= 0 (i = ji, . j — 1, . • •><?)• 

But since this summation includes values of j corresponding to the 
columns of F, it follows that the rows of F assumed to be inde- 
pendent are linearly dependent. From this contradiction we deduce 
the second part of our theorem. 


6. Homogeneous linear equations. Let A be a p x q matrix 
over the field K. Consider the g-tuples (x v ...,x q ) which satisfy the 

equations a il z 1 +...+a tq x a = 0 (i = 1, (1) 

If y = {y v .-.,y g ), and z = (z v ...,z q ) are two ^-tuples satisfying (1), 
th ^ n yb + zc = {y 1 b + z 1 c,...,y q b + z q c) 


also satisfies (1), for any b and c in K. It follows that the ^-tuples 
(z v ...,x q ) which satisfy (1) form a right-hand vector manifold. Any 
vector of this manifold is called a aolution of the left-hand linear 
homogeneoua equations ( 1 ), in which (x v ..., x Q ) is called the unknown. 
We write the equations (1) in matrix notation as 

Ax = 0, (2) 


where x — [x v ...,x q ]. 

If A is of rank r there exist regular matrices P and Q such that 


PAQ = 



Let y = [y v be a solution of (2), and let 

2 = Q~ x y- 

Then 0 = PAy 

= PAQz 

t^l » • • • j b]* 

z* = 0 (i = 1, ...,r). 


Hence 
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On the other hand, let z r+1 , ...,z q be any elements in any extension 

of K, and let r . , 

z [0, . . 0, z r ^ f . . ., zj, 


and let 
Then 


y = Qz. 

Ay = P^PAQz 


= P- 1 //. 
= 0. 


ft 3 : 


Hence we obtain any solution of (2) as follows: let 
z = [0, z r+1 , ...,z q ], 

where z r+v ...,z q are indeterminate over K. Any solution y of (2) 
is obtained by a suitable specialisation of the indeterminates in Qz. 
A basis for the manifold of solutions is obtained by taking 

z = [0,...,0, ( h = r+l,...,q). 

Hence the manifold of solutions has dimension q — r. 

Let us call this manifold, which consists of right-hand vectors, 
R, and let L denote the manifold of left-hand vectors defined by the 
rows of the matrix A. If 

a = (fflj, ..., af) 

is any vector of L, we can write 

p 

a i = ?,p j a ji (i = 1, ...,q). 
i 5=3 1 

If y is any vector of R, 

a x y x + ...+a q y q 

= S Pj a jiVi + ••• + ip } a ja y a 

j =1 1 


“ S S = °- 

j=l l — 1 

The two vectors a and y are said to be apolar. 

The theory of right-hand linear homogeneous equations 

£ = 0 ( 3 ) 

i «=1 

can be developed as above, and we can prove that if the matrix 

B = (b {j ) 
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is of rank t, the solutions form a left-hand vector manifold of dimen- 
sion q — t. In particular, consider the case when the columns of B 
are right-hand g-vectors forming a basis for the manifold R of 
Solutions of (2). Then 

t = q-r, 

and the solutions of (3) form a left-hand vector manifold of dimension 

q — t = r. 

But any vector of L is then a solution of (3), and it therefore follows 
that L is the manifold of solutions of (3). 

Hence the manifold L and the manifold R are in a dual relation- 
ship, each defining the other uniquely. 

We conclude this section by using the foregoing results to prove 

Theorem I. If A is a pxq matrix of rank r A , B a qxr matrix of 
rank r B , then G = AB is a p xr matrix of rank r c , where 


r A + r B-^^ r c< min ( r A> t b)- 


(i) Since B is of rank r B there exist regular matrices Q (of type 
q x q) and R (of type r xr) such that 


B — Q(I rB 0\ R- 

(4) 

o 

O 


Also, by § 4, Th. I, 


C* = GR~\ A* = AQ 

(6) 

are of ranks r c , r A respectively. We write 



A* = (A* A*), 

where A* is a p x r B matrix, and A* a px (q — r B ) matrix. Now A* 
has r A linearly independent columns, regarded as right-hand 
vectors. It follows that Af has at least 

linearly independent columns, since any set of r A columns of A 
contains at least this number of columns of A*. Hence, the rank 
of the matrix 

(Af 0) 


is at least 


rA + r B ~q- 
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But (At 0) = (A* A*)^ ra Oj 

3 ° 

= ABR- 1 , from (4), 

= CR- 1 

= G*, from (5). 

Since G* is of rank r 0 , it follows that 

r A + r B-^^ r o- 

(ii) The solutions of the equations 

Cx = 0, 

form a right-hand vector manifold R of dimension r 
any solution of the equations 

Bx = 0, 

we have Gy = A By — 0. 

Hence, any solution of (7) is a solution of (6). But the solutions of 
(7) form a vector manifold S of dimension r — r B . Since 

S^R, 

it follows that r — r — r c ; 

that is r Q ^ r B . 

If instead of considering equations (6) and (7) we considered 
the equations 

xC = 0, and xA = 0, 

we should obtain the relation 

r 0 <r A . 

Finally, then r c < min (r A , r B ), 

and the proof is completed. 

7. Matrices over a commutative field. The remainder of this 
chapter is devoted to the case in which the ground field K is com- 
mutative. Most of the results we shall obtain will be independent of 
the characteristic of the field, but some will be proved only in the 
case of a field without characteristic. Unless the contrary is stated, 
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K is assumed to be commutative, and no restriction is imposed on 
its characteristic. 

The rank of a matrix A is now equal to the rank of its transpose 
‘A' [§5, Th. II]. Hence, in particular, if A is a regular nxn matrix, 
A' is also a regular nxn matrix. An nxn matrix of rank less than 
n is said to be singular, and the terms ‘non-singular’ and ‘regular’ 
are synonymous. 

We consider how to obtain the transpose of the product AB of a 
pxq matrix A and a qxr matrix B. Let 

AB = C, 

where c tj = £ a ik b kj ( i = 1 , ...,p\ j = 1 , ...,r). 

& —1 

If C' is the transpose of G, 

c 'ij = C H 

Q 

— Z a jk t>ki 

fc -1 

- S 4 jKk 

fc-i 

Q 

— Z Kk a kj> 

= 1 

since K is commutative. Hence 

C' = B'A\ 


Now let A be a non-singular nxn matrix. A' is also non-singular, 
and therefore has an inverse 


We then have 
and hence 
But 

and therefore 
that is 


B = {A')- 1 . 

A'B = I n = BA', 

(A’ BY = I' n = 4 = (BA'Y. 
(A'Y = A, 

B'A = J n = AB', 

B' = A~\ or B=(A~ 1 Y. 


Hence, for any non-singular matrix A, 


(A')-* = (A-*)'. 


We shall usually write this matrix as A, and call it the complement 
of A. 
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We now have three methods for obtaining nxn matrices from a 
non-singular nxn matrix A. These are: forming the inverse, the 
transpose, and the complement. If we denote by S the operation of 
forming the inverse, T that of forming the transpose, and V that 
of forming the complement, we have 

TU = 8 = UT, 

US = T = SU, 

ST = U = TS, 

and S 2 = T 2 = U 2 = I, 

where I is the identity operation which leaves A unaltered. The four 
operations I, S, T, U form an Abelian group. 

Other properties of these operations which can be verified 
immediately are: 

(A + B)' = A' + B', 

(. AB )-i = B-^A- 1 , 

(. AB ) = IB, 

where A, B are nxn matrices, which need not be regular for the 
first property to hold. It is only in the last case that K must be taken 
as commutative. The first two properties hold in any field. 

8. Determinants. When the ground field K is commutative 
we are able to define determinants exactly as in the case in which K 
is the field of complex numbers. Determinants over a commutative 
field K have many properties which are similar to those of deter- 
minants defined over the field of complex numbers. It will therefore 
be sufficient to give a brief outline of the main results which we 
shall need in the sequel. 

We begin by defining certain symbols which are frequently used, 
(i) The symbol 

(ore^J 

is zero unless , . . . , i n is some derangement of the first n natural 
numbers. If the derangement is an even permutation [I, § 1], 

eh-*n = e h 1. 

If the derangement is an odd permutation, 

6<i •••*» = e*. = — 1. 
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Both notations will be used as may be convenient. In tensor theory 
there is an important difference to be observed in the uses of the 
t,wo symbols, but this difference does not concern us. 

(ii) The symbol 

<n) 

is zero unless ij, ..., i p and j x , ... ,j p are derangements of the same 
set of p distinct integers, each one of which lies between 1 and n 
(inclusive). When the derangements are of like parity, 

- 1. 

(n) 

and when they are of unlike parity, 


Evidently 


*£:::& = -i. 

(n) 

••• hi — pii ••• t» 

( 71 ) 


The (n) beneath the ^-symbol is usually omitted when there is no 
doubt about the value intended. 

Now let A = (a tj ) beannxn matrix. We define its determinant 
to be 

detvl = 


% 

a l2 

• a i7l 

®21 

a 22 

• a 2n 

®»1 

a n 2 

• a nn 

n 

n 


S- 

S eil " 

' ina i x 1 • 

*1-1 

in- 1 


.a. 


From this definition it follows at once that if B = (b i} ) is & pxq 
matrix, and A is the nxn submatrix whose elements are in the 
i x th, ...,i n th rows and^th, ... j n th columns of B, then 

det A=i i 

fcl-1 1 ( p ) 

Theorem I. If A is an nxn matrix, A' its transpose , 

det .4 = det A'. 


In the expression for det .4 consider any term 
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in which eh—** is not zero. Then i x , ...,i n is a derangement of 
defining a substitution [I, § 1] 


Z 1 2 


\*1 *2 

• »J* 

Let the inverse substitution be 


n 2 

. 71 \ 

\j i h 

• ;»/ ‘ 


Then these substitutions have the same parity, and therefore 

eh-** = eh— I*. 


If we rearrange the factors of a itl ...a inn> we obtain the equation 

a iil-" a i n n — a lj t ••• a nj n > 

and eh-*n a hl ...a inn = eh-Ua lh ...a njn . 

Conversely, any term 

eh-ina lh ...a nU 

in which the e-symbol is not zero determines uniquely a term 

eh- < na iil .,.a inn . 

Therefore 


det.4 = 


n 


E • 

W 


n 

in - 1 


a 


hi •* 


a i n n 


n n 


= 2 ... 2 eh-ina lh ...a ljn 

ii“l in = l 


n 


= 2 ••• h eh-ln a ' hl ...a' jnn = detA'. 

i 1“ 1 jn— 1 


Theorem II (Laplace's expansion ): 


«ii 

®21 

«12 

®22 

• a 2n 

ii 

a Hh 

* a iiip 

«nl 

a n 2 

• ^nn 


a iph 

®ip Jp 


^inip+i 


%+iin 

a ini n 


where j v ...,j n is any fixed derangement of l, ...,n, and the summation 
is over all possible divisions of 1, .. ., n into two sets 

< ip and ^ ^ ^ 
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(so that there are (”) terms) and = + 1 or — 1 according as i v ...,i n 
and j v ...,j n are like or unlike derangements of l,...,n. 

By rearranging the factors a {j in the expression for detJ. we 
find that 

n n 

det^l = S ••• S eil - tna i 1 i--a inn 

i,-i u - 1 

= £ - £ 

«=1 

Since = •••*», 

n n 

det = €*»•••*« 2 ... S eki '" kna k l h--- a k n U' 

lC \ *■* 1 A* n ~ X 

In the summation on the right consider the p\ x (»— p)\ terms in 
which k v ...,k p are derangements of i v ...,i p , and k p+1 , ...,k n are 
derangements of i p+1 , ...,i n . These terms sum to 


s t:± S ... S » 

*,-i fc«-i 


v l,...l n 

a hh 

• a hh 

\ 


^ip+lip+l 

• ^ip+iin 


a ipi 1 

• a ipip 


d in ip+i 

• a inin 


and hence 


det A = SPi 

i 

a hh 

• a id P 


^ip+ijp+i 

• %+ dn 


%h 

* a bb 


^inip+i 

• a inin 


This result is usually referred to as Laplace expansion in terms 
of the determinants of the j x th, ...,j p th columns. 

Using Th. I we obtain a similar expression for detJ. in terms of 
determinants in p selected rows. 

Theorem III. If two columns (rows) of a determinant are the same, 
the determinant is zero. 

We first prove the theorem for the case n = 2. By direct 
evaluation 


a a 

b b 


— ab—ab = 0. 
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Now suppose that in a determinant of n rows and columns the 
Mh and &th columns are identical. Expanding the determinant in 
terms of these two columns we have 


a U • a ln 

= 2 + 

a iih 

%k 

a itU * a i,)n 

• • • 


«<,ft 

J? 

to 

?5* 

• • • 

a nl • a nn 




a inU * a Uin 


where h,k,j a ,...,j n is a derangement of 1 , Since 

a i t k =0 
a i,ft a iik 

for all values of i v i 2 , the theorem, as applied to columns, follows. 
The result for rows follows from Th. I. 

If we expand det A in terms of the Mh column, 

n 

det .4 = 2 ( — l) i+h a ih a n . a lh _ x a lh+1 . a ln 

i *»1 

• • • • 

a i - 11 • a i-lh-l a i-lh + 1 • a i-ln 

a i + 11 • a i+lh-l ®i+l h+1 • a i+ln 

®n 1 • ft— 1 A+i • 

If we write the coefficient of a ih in this expression as A hi , we have 

n 

2 a ih A hi = det 4. 

i-i 

On the other hand, if h 4= k, 

n 

2 »ik A hi = det B, 

i = 1 

where B is the matrix obtained from A by replacing the Mh column 
by the Mh column. By Th. Ill, det B = 0, if h + k. Hence 

n 

2 a ik A hi = $hk det ^4. 

i- 1 

Expanding in terms of rows we get, similarly, the equation 

n 

2 a hi A ik = ^hk det ^4. 
i-1 

We call A u the signed minor or cofactor of a i} in det A. 
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We now prove 

Theorem IV. If A, B, C are nxn matrices, and C = AB, then 
det G = det A det B. 

if M = 


det (7 = 2 ... 2 e i t- i nc iil ...c inn 

it — 1 in — 1 , 

= 2 2 S ••• S “^inin^Jnn 

lt-1 in-1 Jt-1 Jn-1 

- S .... £ S •• - S («** S • • • ( 6 * 1 • ■ ■ ■ ■ K n) 

li“l tn = l ii«l 1 

= 2 — 2 e^-^det^( 6 i l ... 6 inn ) 

*»-l 

= det A det B. 


Theorem V. If A is any matrix with n columns, B any matrix 
with n rows, any t-rowed determinant of the matrix A B is equal to a 
sum of terms, each of which is the product of a t-rowed determinant 
of A by a t-rowed determinant of B. 

Let A = (a^) be a pxn matrix, B = (b i} ) an nxq matrix, so that 
C — (c^) = AB is a p x q matrix. 

Any <-rowed determinant D extracted from G is of the form 


D = 




that is 


D= 2 • 

r, -1 


c Uh 


• 2 c n ki • • • C T < kt 

T|— 1 


C iikt 

C itk t 


P 

• s • 

Tjt -1 


p n 

2 2 ■ 

Tt=l i, — 1 


n 

■ S *i‘; 

it - 1 




a uit bji k( 


- 2 ... 
ii-l 


S 2 • • • X ( a n Ji • • • ft) (fyi • • • bj t jc t ) 

jt-* 1 T X -1 T/-1 


n n 


- 2 - 2 

ft-i ft -1 






a Uh • a itU 


(fyi ki 
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Now 

a iih 

• a hit 

ll 

4 

a *i0i * a hot 


a iti i 

• a Uit 


a itOi • a itOt 


where the summation £ is over the 

a 

which can be chosen from the set (1, .. 


o 


sets of distinct 
.,ri). Hence 


9v 


9t 


D = 2 2 - S 


a. 


ol i-i U-i\ 


*i0i 

a Uox 


a, 


'i\ Qt 


(b jikl 


a 


Uat 


-SI 

o 


a . 


hOi 


a 


’hot 


a 


Uoi 


a , 


it Ol 


J Oi ki 


J Oikt 


J oih 


J Otk t 


We have proved then that 


%ki 


*i k% 


°iik t 

C itk t 


= s 

a 


l l 0\ 


a 


llOt 


a. 


'*(0i 


a 


itOt 


01*1 


0(fcl 


^oik, 

^Oth 


and this is the form in which we shall subsequently use this theorem. 

The values of certain determinants will be needed in the next 
section, and we state these values here. They can be obtained at 
once, expanding the determinants by rows or columns. 


(i) 


0 a 9 


0 0 


0 

0 

a„ 


— diCl* CL„ 


This determinant has non-zero elements only along the principal 
diagonal ( a i{ = a t , a i} = 0 


(ii) 


a i 

0 

• 

• 

• 

• 

• 

• 

0 

a., 

• 

• 

• 

• 

• 

• 

o • 

• 

• 

• 

• 

• 

• 

a 

• 

• 

• o 

• 

• 

• 

. 

• 

• 

a k 

• 

• 

• 

• 

• o 

• o 

• 

« 

• 

• 

• 

• 

• 

• 

• 

a n 
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Non-zero elements occur only along the principal diagonal and in 
one other place (a i{ = a it a hk = a for a particular A, A). 

(iii) If A = (a w ), where 

a ij = s h f j (1 i = = 1,...,»), 

then det .4 = e A i— 

With these preliminaries, we consider certain operations on a 
matrix A which correspond to the elementary transformations 
discussed in § 4. The operations are: 

(a) interchanging two rows of A, say the Ath and Ath rows; 

( b ) multiplying the elements of the Ath row by an element a of K\ 

(c) adding to the elements of the Ath row a linear combination 
of the corresponding elements of the other rows. 

The operation (a) is equivalent to premultiplying A by the matrix 

B = (S hii ) {i,j = 1,. ..,n), 

where (Aj, ...,A„) = (1,...,A— 1 , A, A+ 1 , — l,h,k+l, 

That is, B is the unit matrix with the Ath and Ath rows interchanged. 
By (iii) above, det B = — 1, and therefore 

If the two rows of a determinant are interchanged the sign of the 
determinant is changed. 

Operation (b) is equivalent to premultiplying A by 



where b t = 1 if i 4= A, b h = a. By (i), det B = a, and therefore 

If the elements of a row of a determinant are multiplied by a 
constant a, the determinant is multiplied by a. 

Operation (c) can be split up into a set of operations, each of 
which consists of adding to the Ath row a multiple b k of the corre- 
sponding element of the Ath row, A taking the values (1, ..., A — 1, 
h+l, ...,n). For a given A each operation is equivalent to pre- 
multiplying A by a matrix B of the type considered in (ii) above. 
B has 1 in each place on the principal diagonal, b k in the Ath row 
and Ath column, and zeros in all other places. But det B = 1, and so 
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Adding to the dements of any row of A a linear combination of the 
corresponding elements of the remaining rows does not alter the value 
of det ^4. 

All these operations may be performed on columns instead of 
rows. If a given operation on rows is carried out by premultiplying 
A by a matrix B, postmultiplying A by B' gives the corresponding 
operation for columns. Since 

det B = det B' , 

the results stated above remain true when ‘column’ is read for 
‘row’. 

We now discuss some properties of the cofactors A }i of the 
elements a i} of a determinant. If 

A = (an), 

the matrix A* = {Ay) 

is called the ad jugate of A. We saw that 

n n 

S a ik^M = S a hi A ilc = 4* det A. 

i- 1 i-1 

Hence AA* = (det A) I n = A* A. 

If, therefore, det A 4=0, the matrix 



is the inverse A~* of A. The determinant of this matrix must be 
(det A) -1 , for if A is regular, so that 

A(A-i) = 4 . 

then det A det (A. -1 ) = det 4 = 1, 

and therefore det(^4 -1 ) = (det A) -1 . 

On the other hand, if A has an inverse B, we have 

det .4 det I? = 1, and hence det A 4=0. 

We have thus proved 

Theorem VI. An nxn matrix A is non-singular if and only if 
det A is not zero. 

Let us now examine the case in which det A = 0. Then 

n 

S a u A ip = 0 ( h >P = !. 

i-1 


HPT 


6 
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Now the cofactor A jq is itself a determinant, and contains all the 
rows and columns of A except the gth row and the jth column. 
If we multiply the above equation by the cofactor of a hk in A jq , 
and sum over the following values of h: 

h= 1, ..., j- 1,0+1, 

we obtain the equation 

£**(*) Aip = °* 

where denotes the determinant obtained from A jq by re- 

placing the element a % by a u , for each value of l. Since 




- A)q> 


•M 1) 

~ — 

and otherwise 

A (i\ 



A)q \k) 

= 0, 

the equation 


© 

II 

a 


reduces to the equation 

^■jq-^-kp -A-kijAjp = 0. 

Hence any two rows (or columns) of the adjugate matrix A* are 
proportional, and we have 

Theorem VII. If detJ. = 0, and p> 1, the determinant of any 
pxp 8ubmatrix of A* is zero. 

We now prove 

Theorem VIII ( Jacobi’s Theorem ): 


A idl 

• A hi P 

II 

a jp+l ip+l 

^ ip win 

(det^)® -1 , 

A iph 

' A i p j p 


a )nip+ 1 

a ini n 



where i lt ...,i n and j v ...,j n are any derangements of the set (1, ...,n). 
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Let B be the matrix obtained from A by interchanging the ith 
and jth rows of A. If, with the usual notation, the elements of the 
adjugate matrix B* are represented by B ra , it is easily seen that 

B ra = -A rg (*=¥*, j), 

and B ri = -A rj , 

Brj ~ —A-ri- 

Assuming Jacobi’s Theorem to be true for the matrix B, we readily 
prove its truth for the matrix A, and conversely. A similar result 
is obtained by interchanging any two columns of A . Hence, if we 
prove the theorem when 

i k=jk = tc (k = 

the theorem as stated above follows. 

If p = 1 the theorem is true, by definition of A itJl . If p > 1 and 
det A = 0, it follows from Theorem VII. Assume that det A 4= 0. 
Then 


^11 


^ip 

A 


a ll 


• a m 


Api 

• 

4 

X pp * 

A 

• ^pn 


. 


• • 

• • 


0 

• 

0 1 

0 


• 


• 


0 

• 

• 

0 0 

• • 
1 


a nl 


• ^nn 



= 

det A 

0 


0 


0 



0 

det A . 


• 


• 



0 

0 

det A 


0 



a p+ii 

• • 

Q 

p+lp 


• tlp+ln 



«* i 

• • 


® np 




and therefore 


• A ip det A ^p+ip+i • (detA) p . 

• • • • • • 

A pi • App d n p+\ • 


Since det A 4= 0, the theorem follows. As a special case we have the 


result 


det A* = (det A) n_1 . 


We have already shown [Th. VI] that an n x n matrix A is non- 
singular, and therefore of rank n, if and only if det A is different 
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from zero. We now obtain further results connecting determinants 
with the rank of any matrix. If A is a p x q matrix of rank r, there 
exists at least one rxr submatrix of A which is of rank r, and any 
8X8 submatrix of A (s > r) is of rank not greater than r, and there- 
fore less than s [§5, Th. III]. It follows from Th. VI that the 
determinant of any sxs submatrix must be zero. Hence we have 

Theorem IX. A pxq matrix A is of rank r if there exists in it an 
rxr submatrix with determinant different from zero, and if every 8X8 
8ubmatrix (s > r) has zero determinant. 

If A = {a if), and 

a iih ' a hir 

a iri x • a irir 

the matrix 

/ a i il * 

\a ir i . ®i r qj 

is of rank r. Hence the vectors given by the i x th, ...,i r th rows of 
A are linearly independent. But since A is of rank r, the vectors 
determined by the rows of A form a manifold of dimension r, and 
therefore any row is linearly dependent on the i x th, ...,i r th rows. 
It follows from this reasoning that to solve the equations 

a^x^ ...+a iq x q = 0 {i=l,...,p), (1) 

we have only to solve the equations for i = i v ...,i r . To simplify 
the notation let us write 

a i P j = b p j, 

and consider the solution of the equations 

b n «!+... + b iq x q = 0 (i = 1, ...,r). (2) 

Then b xil . 6 Vr 4= 0. 

• • • 

^rix • b r j r 

By renaming the x { we may suppose that j p = p (p = 1, ..., r). Now 
let Bji be the cofactor of b if in 

6 = I 6 X1 . 6 lr I 
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If (x v ...,x q ) is any solution of the set of equations 

b il x l +... + b iq x q = 0 (i = 1, (3) 

then S S B hi b u x i = 0 = !» • ••,*•)> 

i=l j-l 

that is, 


9 


bx h + 2 b n ■ 

• b lh-l 


b l h+ 1 

b lr 

Xj = 0. (4) 

i=r+ 1 






b n 

■ K h- 1 

b r j 

b rh+ 1 

■ Kr 



Conversely, let a; r+1 , be any q — r elements of K, and let 
x v ...,x r be defined by the above equations (4). By direct sub- 
stitution we see that (x v ...,x q ) is then a solution of (3). Thus we 
see how to construct explicitly all the solutions of (1). 

Theorem X. A necessary and sufficient condition that the non - 
homogeneous equations 

9 

S a ijXj = c t (i = l,...,p) 

j =1 

have a solution is that the matrices 



have the same rank. 

Let us refer to the two matrices in question as the matrix of 
coefficients and the augmented matrix. The rank of the augmented 
matrix cannot be less than that of the matrix of coefficients. If 
the equations have a solution, the last column of the augmented 
matrix is a linear combination of the remaining columns of the 
matrix. The rank of the augmented matrix is therefore equal to 
the rank of the matrix of coefficients. On the other hand, if we are 
given the equality of the ranks, we deduce that the last column of 
the augmented matrix is linearly dependent on the first q columns, 
and if the equations 

9 

Cf = S (i=l,...,p) 

}= i 

express this linear dependence, (£ x , ..., E, q ) is a solution of the given 
equations. 

None of the results proved so far in this section requires any 
restriction on the characteristic of K. We conclude with a theorem 
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which is only true if K is not of characteristic 2. First of all, an 
nxn matrix A is said to be symmetric if 

A = A\ 

skew-symmetric if A — —A'. 


If A is skew-symmetric, 

det A = det(-A)' 

= det( — I n )detA' 
= (— l)"det A. 

Hence, if n U odd, 2 detA = 0, 


and therefore, if K is not of characteristic 2, 

det A = 0. 


On the other hand, 


A = 



is skew if K is of characteristic 2, since any element l of K satisfies 
the relation ^ ^ 


But det A = abc + a/ 2 + bg 2 + ch 2 , 

which is not zero for all a, b, c,f, g, h in K. In fact, if 
a = b = c = 1; f = g = h = 0, 
det.4 = 1. 

This brief account of determinantal theory establishes the main 
results which we shall use in later chapters. x We have shown that 
nearly all the results proved in textbooks for determinants whose 
elements are real or complex numbers extend to determinants over 
any commutative field. The many theorems on special methods of 
expansion, and on the value of determinants of special type (e.g. 
Vandermonde’s determinant) which can be found in textbooks can 
easily be deduced from the above theorems for determinants over 
any commutative field. When necessary, such special results will 
be assumed in the sequel. 

9. X-matrices. Let if be a commutative field, and let A be 
an indeterminate over K. In the following chapters we shall often 
have to consider matrices whose elements are polynomials in A, and 
it is convenient to obtain here a number of results which will 
frequently be used. 
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Strictly speaking, we have so far only defined matrices over a 
field. We can, however, deduce the basic properties of matrices 
whose elements are in an integral domain I, say if [A], simply by 
regarding these matrices as forming a subset of the set of matrices 
whose elements are in the quotient field of the integral domain. 
Addition and multiplication of matrices of the subset give further 
matrices of the subset. 

A matrix whose elements are in the integral domain A [A] is 
called a A -matrix. An nxn A-matrix which has an inverse which is 
also a A-matrix is called a regular A-matrix, of order n. 

Theorem I. A necessary and sufficient condition for the nxn 
A -matrix M to be a regular X-matrix is that det M is a non-zero 
dement of K. 

(i) Suppose that M is regular, and let N be its inverse. Since 
the elements of M and N are polynomials in A, the determinants of 
these matrices are polynomials in A. Let 

det M = /(A), detN = g(X), 
where /(A) is of degree r, g(X) of degree s. Then since 

MN = 

det M detiV = 1, 
that is, /(A)gr(A) = 1. 

But one side of this equation is a polynomial of degree r + s, while 
the other side is a polynomial of degree 0. Hence 

r — s — 0, 

and /(A), g( A) are non-zero elements of K. The condition is therefore 
necessary. 

(ii) Suppose that det AT = m, where to is a non-zero element 
of K. Let M = {m i} ), and write 


n tj = ( — ) {+} to -1 

TOu 

• m i i-i 

m H+ 1 

. TO ln 


ii 

• m j- i i- i 

m i- 1 i+ 1 



%n 

• m i+ 1 <- i 

m j+ 1 l+l 

• m j+l n 


™ n i 

• m n i-1 

m nl+ 1 

• n 


Then N = (n {j ) is a A-matrix, and [§ 8, Th. VI] 

MN = I n . 
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We now define equivalent A-matrices. Two pxq A-matrices A 
and B are said to be equivalent if there exist regular A-matrices M 
and N, of orders p and q respectively, such that 

A = MBN. (1) 


This relation is reflexive, since 


From (1) we have 


A ipAIq. 

B = M-'AN 


and M~ x , N -1 are regular A-matrices, since M, N are regular. The 
relation is therefore symmetric. Again, if besides (1) we also have 

B = PCQ, 

where P and Q are regular, then 

A = (MP) G(QN), 

where MP, QN are regular A-matrices. The relation is therefore 
transitive, and it is thus an equivalence relation. 

We wish to find a set of canonical forms for all p x q A-matrices. 
As in § 4 we say that a set of p x q A-matrices is a set of canonical 
forms for the set of allpxq A-matrices if 

(а) no two matrices of the set are equivalent; 

(б) there is a matrix of the set equivalent to any given pxq 
A-matrix. 

Clearly, if we can construct such a canonical set, and also find 
a method for determining the canonical form equivalent to any 
pxq A-matrix, we shall have a method for determining whether 
two given A-matrices are equivalent. 

Our first step is to consider certain simple operations on a A- 
matrix A. These are known as elementary transformations [cf. §4], 
and produce a A-matrix B equivalent to A. 

(i) Let k lf ..., Jc p be any derangement of the integers 1, ...,p, and 
let P be the pxp matrix 

P = (fik(i) (i,j =l,...,p). 

Then detP = ± 1, and the matrix B = PA is the matrix obtained 
from A by performing the permutation 

/I 2 . p\ 

\&1 ' kp) 

on the rows. Hence, the elementary transformation of A which 
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consists of a permutation of the rows of A transforms A into an 
equivalent matrix. 

(ii) A permutation of the columns of A transforms A into an 
equivalent matrix. The proof is similar to that in (i), the new matrix 
being of the form AQ. 

(iii) Let R be the pxp matrix (r i} ) where 

r ij = 

r h j=fM) (j = l,...,h-l,h+l,...,p), 

where /^(A) is a polynomial in A [A], and 

r hh = A(A) = a (a in K, and not zero). 

Then det R = a, and R is therefore a regular A-matrix. We obtain 
a matrix B = RA equivalent to A by multiplying the 6th row by 
a and adding to it/ x (A) times the first row, / 2 ( A) times the second 
row, // t _i(A) times the ( h — l)th row, / a+ 1 (A) times the (h+ l)th 
row, . . ., f p ( A) times the pth row. 

(iv) If a column of A is multiplied by a non-zero element of K, 
and to it we add a multiple (by a polynomial) of any other column, 
we obtain a matrix, equivalent to A, of the form AS. The proof is 
similar to that in (iii). 

We now show how a matrix can be simplified by a series of 
elementary transformations until it is reduced to a canonical form. 

Let A be a p x q A-matrix. If it is the zero matrix, any matrix 
equivalent to it is clearly zero, and no reduction is possible. We 
therefore assume that at least one element of A is not zero, and by 
a permutation of rows and columns we bring a non-zero element of 
A to the top left-hand corner. Let this element a n be of degree p 
in A. We show that unless o u , regarded as a polynomial in A, is a 
factor of every element a i} of A, we can transform A into an equi- 
valent matrix C, where c n is not zero and is of lower degree than o u . 
Our transformation will be a series of elementary transformations. 

(i) If, for som ej, a u is not a factor of a 1} , then 

a u = a u 6(A) + c(A), 

by the division algorithm, where c(A) is not zero, and is of lower 
degree than a u . From the jth column we subtract 6(A) times the 
first column, and then interchange the first and jth columns. The 
resulting matrix is equivalent to A, and has c(A) in the top left-hand 
comer. 
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(ii) If, for some i, a u is not a factor of a n , we operate on rows 
instead of columns. 

(iii) Suppose that a u is a factor of all non-zero elements in the 
first row and column, but is not a factor of a hk (h, k > 1). Let 
a a = b { a n , and a 1} = c } a lv Then if we subtract from the ith row 
b t times the first row, for each *> 1, and then subtract from the 
jth column c, i times the first column, for each j > 1, we obtain a 
matrix B equivalent to A in which 

6 U = <*u, = 0 (j> 1), b tl = 0 (i> 1), 

while b hk is still not divisible by 6 U . Now add the feth row of B to 
the first row, and apply method (i). We obtain a matrix G equivalent 
to A, with c u 4= 0, and of lower degree than a u . 

Since there is a finite upper bound to the degree in A of the 
elements of the matrix A, the processes described above cannot be 
repeated more than a finite number of times. We eventually obtain 
a matrix B, say, equivalent to A, in which 6 n is a factor of every 
element b i} . By (iii) we can arrange that 

bn = 0 (i>l), b lj = 0(j>l). 

Thus a series of elementary transformations has produced from A 
the equivalent matrix 

PAQ= /b u 0 0 

I 0 622 « 

0 : 

\ 0 b p 2 . 



where b u is a factor of all the elements b i} . 
If the submatrix 


B t = lb 


22 




is not zero, we can repeat the process given above, until we obtain 
an equivalent matrix 


P 1 B 1 Q 1 — /c 22 0\ 

\o cj' 


each element of C 2 containing c 28 as a factor. Then 


/I 0\ PAQ/l 

°\ = 

- (b n 

0 

°\ 

O 

O 

qj 

° 

C 22 

0 



\o 

0 

cj 
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so that the elementary transformations on the submatrix B x can 
be considered as elementary transformations on A. Proceeding in 
this way we can transform A into an equivalent matrix in which 
the only non-zero elements are on the principal diagonal, and each 
element on the diagonal is a factor of all those following it. Finally, 
multiplying each row of this matrix by a non-zero constant, A is 
equivalent to the matrix 


MAN = 


IE i(A) 0 . 

0 

• °\ 

/ 0 E 2 (, A) . 

0 

0 

: : 

4(A) 


\ : : : 

• 

. 0 / 


( 2 ) 


where each E t ( A) is a polynomial in A with + 1 as the coefficient of 
the highest power of A, and E t {. A) is a factor of E i+1 ( A). Since equi- 
valent A-matrices have the same rank, and the rank of the matrix 
MAN is evidently r, it follows that r is the rank of A. 

Again, writing MAN = E = (%) , 

an evident extension of § 8, Th. V gives us the theorem 


e hh * e hU 

= 22 
A n 

* m *i A, 


a x l m • a \ 1 nt 


n m h • n />i it 

^ith ‘ 


Ai * 'W'it A t 


a A t ii X • a x t m 


n Hfi • n nit 


It follows that if D t ( A) is the highest common factor of the deter- 
minants of order t formed from A , D t (\) is also a factor of each deter- 
minant of order t extracted from E. But since M, N are regular, 
the equation MAN = E 


can be written as 


A = M-'EN- 1 , 


and therefore the highest common factor of determinants of order 
t formed from E divides each determinant of order t formed from A. 
Hence, save for a factor in K, this highest common factor is D t ( A). 
But from the properties of the polynomials E t ( A), 


D t (\) = E 1 (\)E 2 (\)...E t (\), 
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provided that the coefficient of the highest power of A in D t ( A) is 
taken to be + 1. Therefore 

W) = A(A)/A-!(A) (i = 1 r), 

defining D 0 (A) = 1. 

This last result shows us that the Ef A) are uniquely defined by means 
of the highest common factors of the determinants of different 
orders extracted from A. Calling the E { ( A) the invariant factors of A 
we have now proved 

Theorem II. If Aisapxq X-matrix of rank r, and E 1 (X), . . . , E r ( A) 
are its invariant factors, there exist regular X-matrices M, N such that 


MAN = 


/ ^l(^) 

0 

0 

• °\ 

0 

E 2 (X) 

0 


: 

• 

. E r (X) 

i 1 

\ : 

. 

• • 

• • 

. 0 / 


Since the inverse of a regular A-matrix is a regular A-matrix we 
also have 

Theorem III. Two p x q X-matrices A and B are equivalent if and 
only if they have the same invariant factors. 

Hence the set of matrices (2) is a set of canonical forms for 
pxq A-matrices. 

If the field K is such that every polynomial /(A) in if [A] can be 
written as a product of linear factors, it is sometimes convenient 
to use elementary divisors in place of invariant factors. We define 
these as follows. Let 


EfX) = (A — oq)®!! (A — a a )®i2 ... (A-a 8 ) e «, 
E 2 (X) = (A-cq)®« (A-a 2 ) e 2 * ... (A-a g )«a», 


E r { A) = ( A - oq)«n (A - a 2 ) e n ... (A - a s )®«, 

where each e w >0, and A — cq, ...,X — cc s are the distinct linear 
factors of E r ( A), that is, e rj > 0 (j = 1 , ..., s). Then those factors 

(A-oq)®!!, (A — oq)«*i, ..., (A-a 2 )«»«, .... (A-a 8 )®w, 

for which the indices e ti are greater than zero are called the 
elementary divisors of A. By definition, these are determined by 
the invariant factors. 
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Suppose, conversely, that we are given the elementary divisors 
of A, which is known to be of rank r. Let them be 


(A-a)°», (A-a)°s, ..., (A-a)°< {a x ^a 2 ^ ... >a,), 
(A — (A — ...» (b x ^b 2 ^ ...), 


(A — 8) d ', (A — 8) d *, ..., (d x ...)• 

Each invariant factor Ef A) must be a product of these factors. 
Since E r (X) contains EfX) as a factor (i < r), E r (X) must contain the 
highest power of A — a, the highest power of A — /?, etc. Hence 

E r ( A) = (A - a)“i (A - pfi ... (A - S) d K 

Similarly, since E r _ x {X) is divisible by E t ( X) (i<r— 1), 

E r - x (X) = (X-a)°*(X-/1p... (A -S)\ 

and so on. Once all the elementary divisors which are powers of 
A — a are used up, the remaining E^X) do not contain A — a as a 

factor. Similarly for A—/? When all the elementary divisors 

have been used up, the remaining invariant factors (if any) are 
Ef X) = 1. An equivalent form of Th. Ill, is therefore 

Theorem IV. A necessary and sufficient condition that two pxq 
X-matrices be equivalent is that they have the same rank and the same 
elementary divisors. 

Our next theorem is often useful in subsequent chapters. 
Theorem V. If A, B are two nxn X-matrices which can be written 

A = A x X — A 2 , B = B x X—B 2 , 


where ^4^ A 2 ,B X , B 2 are nxn matrices over K, and B x is non-singular, 
then the equivalence of A and B implies the existence of non-singular 
matrices P, Q over K such that 

PAQ = B. 


Since A and B are equivalent, there exist regular A-matrices 
M, N such that MAN = R 


Let 


X = B 2 B X \ 


Then if M = M 0 + M x X+ ... + M r X r , 

where the M t are matrices over K, and if 
p = J^A'- 1 + (M r _ x + XM r ) A'- 2 + . . . + (M x + XM 2 + . . . + X'-'M r ), 
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and P = M 0 +XM 1 + ...+X'M r , 

then M = (AJ n - X) p + P = (U n - J5 a Pf 1 ) 5, B^p + P, 
that is M = BP 1 + P, 

where P 1 = B^p. 

Similarly, let Y = 2?f 1 B z , 

and write 

N = N 0 +N 1 \+...+N a \', 

q = N a A«-i + (N^ + N S Y) A* -2 +... + (N 1 + N 2 Y + ...+N s F*-*), 

q = n 0 +n 1 y+...+n 8 y\ 

Then we have 

N = q(M n -Y) + Q 
= QlB+Qy 

where Q 1 = qB f 1 . 

Now PAQ-B = (M-BP 1 )A(N-Q 1 B)-B 

= MAN -MAQ^-BP^N + BP X AQ X B-B 
= BP 1 AQ 1 B- BN~ 1 Q 1 B - BP X M~'B 

(since MAN = B). Thus PAQ — B — BRB, where 

B = P X AQ 1 —N- 1 Q 1 —P 1 M~ 1 , 

and since M and N are regular A-matrices, R is a A-matrix. Suppose 
that R is not zero, say 

R — Rq + i?jA+ ... •)■ R( A* (.Bj =i= 0, f ^ 0). 

Then, from the above, 

PAQ — B = BRB, 

where the left-hand side is of degree 1 at most in A, whilst the right- 
hand side is of degree 

t- {-2^2. 

This contradiction shows that R = 0, and therefore 

PAQ = B. 

Since P, Q are independent of A, 

PA 1 Q = B 1 , PA 2 Q = B t . 

Since B 1 is not singular, P, Q are not singular, and the theorem is 
proved. 
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10. Miscellaneous theorems. We conclude this chapter with 
a number of results which will be needed later. Some of them 
require the use of theorems which are more appropriately dealt 
with in later chapters, and it will avoid repetition if we borrow 
these theorems without proof at the present stage. 

If A is any nxn matrix over K, the A-matrix 

A-XI n 

is called the characteristic matrix of A, and its determinant 

/(A) = det (A — AI n ) 

is called the characteristic function of A. The roots of the character- 
istic equation .... A 

J\ A ) - v 

are called the latent roots of A. Let us write 

/(A) = a 0 + a 1 A+ ... +a n \ n . 

Form the adjugate matrix of A — A I n . It is a A-matrix whose elements 
are at most of degree n— 1. We may therefore write it as 

B 0 + B x A + . . . + B n-1 A n_1 , 

where the B i are nxn matrices over K. Remembering that 
{A - AJJ adjugate (A - A/J = det (A - A I n ) I n , 
we have the equation 

(A - AJJ (R 0 + B X A+ ... + B n _ t A-i) =/(A)/ n . 

Equating like powers of A on both sides of this equation, we have 

ABq — 

AB X — B 0 = a 1 I n , 

AB% — = o 2 / n , 

Bn-i = 

Premultiply these equations by I n , A, A 2 , ...,A n respectively, and 
add. We obtain the matrix equation 

0 = a 0 I n +a 1 A+a 2 A 2 +...+a n A n . 

This result is usually stated in the form 

Theorem I. A square matrix satisfies its own characteristic 
equation. 
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We note that since the characteristic function 

/(A) = a 0 + a 1 A + ... +a n X n = det(A- A I n ), 

/( 0) = a 0 — det A. 

Hence A = 0 is a root of the characteristic equation if and only if 
A is singular. 

We next introduce the idea of the square root of a matrix. 
Assuming the ground field K to be not only commutative but 
algebraically closed, so that every polynomial (j>{x) in K[x] is 
expressible as the product of linear factors, we shall prove the 
following theorem in the next chapter [III, § 8, Th. II]: 

If f)(x) is any polynomial in K[x\ such that f>(0) 4= 0, then there exists 
a polynomial g(x) such that [g{x)] r — x is divisible by <f>(x), r being any 
given positive integer. 

We use the case r = 2 of this theorem and take (f(x) to be the 
characteristic function f(x) of a non-singular matrix A. Then we 
can find g(x) such that 

[g{x)] 2 -x =f(x)h(x). 

Since /(A) = 0, 

[g(A)] 2 -A = 0 , 

that is, [gr(A)] 2 = A. 

Hence we have 

Theorem II. If A is a non-singular matrix over an algebraically 
closed field K, there exists a matrix X such that 

X 2 = A. 

Finally, we introduce the idea of the minimum function of a 
matrix A. If 

f(A) = A” l +a 1 A” t - 1 +...-|-a m 
is the polynomial of least degree for which 

rJr(A) = 0 , 

we call ijr( A) the minimum function of the matrix A. Let the in- 
variant factors of A — XI n which are of degree > 0 be 


-®8+lW> •••» 
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We shall see in VIII, § 2, that there exists a matrix B, where 



such that 

(а) B = P^AP for some non-singular P over K, 

(б) det(J3*-AZ) = ±E t { A) (» = «+l, 

In fact, we need only take 



where 

By Theorem I, 


-2^(A) = + 1 + . . . + ci{ rt . 

tfiW = 0. 


We observe that E { (X) is the minimum function for the matrix B { . 
To see this, we first note that when r < r { the rth power of B i has 


the form 


(BiY = 


(* u 

\B C )’ 


where A is the zero (r i — r) x r matrix, I ri _ r is the unit matrix of 
r i — r rows and columns, and B and C are, respectively, rxr and 
rx(r < — r) matrices whose elements are polynomials in the a i} . 

Now let /(A) = b 0 + b 1 A+...+b s \° (b s =# 0) 

be a polynomial of degree s over K , where s<r { . Then it follows that 
f(Bi) is a x r f matrix having the (s + l)th element in the first row 
equal to b s . Hence f(B t ) cannot be zero. 

Again, if g( A) is any polynomial over K such that g(B { ) = 0, 

we can write gr(A) = a( A) E t ( A) + 6(A), 

where 6(A) is of degree less than r t . Then 


b(Bi) = g{B t ) —a{B t ) E^Bj) 

= 0, 

and by the result just proved it follows that 6(A) — 0. Thus g{ A) 
must have E t ( A) as a factor. 


HPI 


7 
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Now suppose that F( A) is any polynomial over K such that 
F{A) = 0. We have 

F(B) = /F(B a+1 ) 0 ° \ 

I 0 F(B s+2 ) . 0 \ 

\ 0 . F(B n )/ 

= F(P~ 1 AP) = P-'F^P = 0, 
and hence F{B i ) = 0 (* = *+ 1, 

Thus F( A) must have E s+1 (A),E 8+2 (\), ...,E n (X) as factors, and 
conversely, if F(\) contains these polynomials as factors, F(B) = 0. 
Since E^ A) is a factor of Ej(\) for i <j we obtain 

Theorem III. If A is an nxn matrix over K, and A — A I n has 
the invariant factors E^ A), ..., E n { A), E n { A) is the minimum function 
of A, and a necessary and sufficient condition that a polynomial f (A) 
over K be such that f {A) is zero is thatf(A) has E n ( A) as a factor. 
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CHAPTER HI 

ALGEBRAIC DEPENDENCE 

In this chapter it is assumed that all fields are commutative. 

1 . Simple algebraic extensions. In I , § 9 , we discussed a method 
of extending a commutative field K by the adjunction of an in- 
determinate x. We now describe another method by which we can 
extend a commutative field K. 

Let f(x) be an irreducible polynomial of degree n in K\x\. We use 
f(x) to define an equivalence relation in K[x~\. 

Two polynomials a(x) and b(x) of K\x\ are to be equivalent if 

a(x)-b(x) = c(x)f(x), 

where c(x) is in K[x]. This relation is clearly reflexive, symmetric, 
and transitive, and it is therefore a proper equivalence relation 
[I, §2]. The polynomials of K[x] are divided into classes by this 
relation. We denote a class of equivalent polynomials by {a(x)}, 
where a(x) is any member of the class. 

If a(x), a'(x) are equivalent polynomials, and b(x), b'(x) are also 
equivalent, 

a'(x) = a(x) + c(x)f(x), 
and b'(x) = b(x) + d(x)f(x). 

Hence a'(x) + b'(x) = a(x) + b(x) + [c(x) + d(x)]f(x), 

and a'(x)b'(x ) = a(x)b(x) + g(x)f(x), 

where g{x) = a(x)d(x) + b(x)c(x) + c(x)d{x)f(x). 

Hence we may define addition of classes of equivalent polynomials 
unambiguously by the equation 

{«(*)} + {&(z)} = {a{x) + b (#)}, 

and multiplication by the equation 

(a(x)}{6(x)} = {a(x)6(x)}. 

It may be verified without difficulty that the classes form an Abelian 
group under addition, and that multiplication is associative, com- 
mutative, and distributive over addition. Hence the classes form 
a commutative ring with the given laws of composition. The zero 

7-2 
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of the ring is the class {0}, that is, the class of polynomials divisible 
byf(x). There is also a unity, represented by {1}. 

Now let (gr(x)} be any non-zero class. Then g(x) is not divisible by 
f(x). Since f(x) is irreducible it follows that the highest common 
factor of /(a;) and g(x) lies in K, and therefore polynomials a(x), 
b(x) can be found so that 

a{x)g(x) + b(x)f(x) = 1. 

We deduce that 

{«(*)} {^(x)} = {1}. 

Thus every non-zero element of the ring of equivalent classes of 
polynomials has an inverse. That is, the ring is a field, which we 
denote by K'. 

If a, fi are any elements of K, 

{«} - m 

if and only if a — ft is divisible by/(x); that is, if and only if a = /?. 
Hence the classes {a} form a subfield of K' isomorphic with K. By 
I, § 2, Th. I, we deduce the existence of an extension K* of K which 
is isomorphic with K' . The field K* is uniquely defined to within 
an isomorphism in which the elements of K are self-corresponding. 

Let £ denote the element of K* which corresponds to {x} in K' . 
Then, if g(x) is any polynomial of K[x], g{E) corresponds to {g{x)}. 
By the division algorithm we can write 

g(x) = h(x)f(x) + g l (x), 

where g t (x) is a polynomial of degree less than n. If g(x) itself is of 
degree less than n, we take h(x) = 0, g(x) = g x (x). Since 

{?(*)} = 

we have g{£) = g x {£). 

It follows that any element rj of K* can be written in the form 

if = a 0 + a 1 g+...+a n _ 1 ^ n - 1 (a f inZ), (1) 

and, of course, the right-hand side of this equation, for any 
a 0 , . . . , a„_ x in K, represents an element of K*. Now the element 
/(|) of K* corresponds to the element {/(x)} of K'. Since 

{/(*)} = { 0 }. 

M) = 0 . 
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Let F{x) be any polynomial over K such that 

m = o. 

Then {f’(x)} = {0}, 

and hence F(x) is divisible by f(x). F(x) is therefore zero or of 
degree n at least, and it follows that the representation (1) of the 
elements of K* is unique. Equation (1) thus gives a very convenient 
means of representing the elements of K*. Using this representation 
we have 

( a o + a x £ + . . . + a n _! I” -1 ) + (/? 0 + £ + • • • + ^ w _i £ n-1 ) 

— (a 0 + /?„) + (a x + /? x ) £ + . . . + (a n _i + ftn-i) £ n_1 * 

To construct the product we have 

(a 0 + a 1 £+ ... + a n _ t ) (/?„ + A £ ■ + . . . + /?„_ x i n ~ x ) 

= «0 /?0 + («oA + «1 A») S + — + <*n-lPn-lZ 2n ~ 2 > 

and we then use the relations 

o = £ r M) = To^+Vi £ r+1 + ...+y*£ n+r ( r = »-2,n-3,...,0) 

to express £ 2 ™- 2 in terms of £"~ 2 , ...,£ 2n_3 , and then £ 2n “ 3 in terms 
of £ n ~ 3 , . . . , £ 2 " -4 , and finally to eliminate powers of £ of degree higher 
than n — 1 from the product. 

A simple example will serve to show that for some fields K a 
true extension is constructed in this way. If K is the field of rational 
numbers, and /(*).*» -2, 

K* is the field of numbers 

Cl + b y]2, 

where a and b are rational. K* is not equal to K. If, on the other 
hand, K is the field of complex numbers, every polynomial of degree 
greater than one is reducible, and no proper extension of K can be 
constructed. 

The method of extension described above is called simple algebraic 
extension. 

2. Extensions of a commutative field. We now consider some 
general properties of extensions of a commutative field K, first 
defining equivalent extensions. Extensions K v K 2 , ...of K are said to 
be equivalent if they are isomorphic in such a way that those elements 

of K v K 2 , ... which lie in K are self-corresponding. 
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Let K* be any extension of K, and let £ be any element of K*. 
We say that £ is algebraic over K if there exists a non-zero poly- 
nomial f(x) in K[x\ such that 

m = o. 


If £ is not algebraic, it is transcendental over K. 

Clearly, any element a of K is algebraic over K; for we merely 

have to take ,, . 

f(x) = x — a. 


Theorem I. If £ is algebraic over K, there exists an irreducible 
polynomial f(x) in K\x\, unique save for a non-zero factor in K, 
such that 



Since £ is algebraic over K there exists a non-zero polynomial 


g(x) in K\x\ such that 


g{i) = o. 


Let g{x) = g l {x)g 2 {x) ...g r {x) 

represent the factorisation of g(x) into irreducible factors. Then 
gM) is in K*, and 

gi(£)g2{£)-g r (£) = g(Z) = °- 

Since K* is a field, and therefore has no divisors of zero, for some 
integer i (1 < * < r) we have 

gM) = 0 . 

Take f(x) = gfx). 

If h(x) is any irreducible polynomial such that 

H£) = o, 

then either h{x) = af(x) (a in K), 

or h(x) and f(x) are relatively prime. In the latter case there exist 
polynomials a(x) and b(x) such that 

a(x)f(x) + b(x)h(x) = 1. 

But this gives us the equation 

a(£)f(£) + b(£)h(£) = 1, 

which, since the left-hand side is zero, is a contradiction. Hence 
f(x) is uniquely defined, save for a non-zero factor in K. It is called 
the characteristic polynomial of £ with respect to K. 
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Now let £ denote any element of an extension K* of K. We con- 
sider all the elements of K* which can be obtained by performing 
the operations of addition, subtraction and multiplication (a finite 
number of times) on £ and the elements of K. Such elements can 

be written ... . . T ,. 

<*o + a ife + ••• + a r £ (a f m K). (1) 


Clearly these elements form a commutative ring with unity, the 
unity being that of K. If y, £ are two elements of this ring, the 
product ?/£ is equal to the product of these elements regarded as 
elements of K*, and therefore j/£ = 0 only if either 9 / or £ is zero. 
The ring is therefore an integral domain. We denote it by !£[£], 
noting that A[£] — K if and only if £ is in K. Since !?[£] is an 
integral domain, it can be embedded in a field [I, § 4]. We denote 
this quotient field by Jl(£), and refer to it as the field obtained by 
adjoining E,to K. 

Case (i). Let £ be algebraic over K , and let/(») be its characteristic 
polynomial, of degree n. If 


we can write 


0(5) = a o + a i£+ ... +a r £ r , 
g{x) = h(x)f(x)+g 1 (x), 


where g x (x) is of degree less than n. Then 


0(5) = 0i(5). 


and hence every element of K[£j\ can be expressed in the form (1), 
where r < n— 1 . This expression is unique, for if any element could 
be so expressed in two ways, there would exist a non-zero poly- 
nomial g(x) of degree less than n such that 

0(5) = o, 

and this would contradict Th. I. 

It follows that the integral domain !£[£] is equivalent to the 
simple algebraic extension of K defined, as in §1, by f(x). Hence, 
in particular, K[Ej\ is a field, and 

m = m. 


As an important corollary we have 

Theorem II. If £ x and £ a are two elements of an extension of K 
which are algebraic over K and have the same characteristic poly- 
nomial, then K(^j) and A(£ a ) are equivalent extensions of K. 
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Case (ii). Now suppose that | is transcendental over K. Then 
a o+ a i £+... + a r £ r = fi 9 +fii £+...+ f} s £* (a r ,^,4=0), 

if and only if r = s and a t - — /3 { (i = 0, It follows that If[£] is 

equivalent to the integral domain K[x\, and therefore that if(£) is 
equivalent to the field of rational functions of the indeterminate x 
over K. The field Jf(f) is called a transcendental extension of K. 

In either case it is clear that K{E) is the smallest extension of K 
which contains £. There may, of course, be extensions of K which 
are proper subfields of K{E), but these will not contain £. For 
instance, if £ is transcendental over K, 

*<=*(£»)<=*(!). 

The ideas introduced above can be extended by considering the 
smallest extension of K which contains a finite set £ x , ...,£ n of 
elements of K*. The elements of this extension are those elements 
of K* which are obtained by applying the operations of addition, 
subtraction, multiplication and division (except by zero) to £ x , ..., £ n 
and to the elements of K. The resulting field is denoted by 
if(£ x ,...,£J, and evidently does not depend on the order of the 
elements g x , ..., £ n . We can define the field K(£ v ...,£„) inductively 
as follows: K(£ v ...,£ r ) is the field obtained by adjoining £ r to 
K{i i, ...,£ r -i) (r- 2,. 

Some of the may be algebraic over K, and we shall usually 
find it convenient to order the in the set £ x , ...,£„ to satisfy 
certain conditions. If each £ t - is algebraic, there is no restriction on 
the order. If some of the are transcendental over K, we take one 
of them as £ x . If £ ia , ..., £ if are transcendental over K^) (and hence 
transcendental over K), we take one as g 2 , an d continue thus. 
Eventually we can say that in If(£ x , ..., £ n ) the element £ x is trans- 
cendental over if, the element^ is transcendental over K{£ v ..., £ g _ x ) 
(s = 2, and Uv-’Zn are algebraic over K(g v ..., £,.). 

With this arrangement, if(£ x , ...,£,) is equivalent to the field of 
rational functions of x v . . . , x r with coefficients in K [I, § 9]. We have 
proved this result when r — 1, and we now assume it is true for 
r = 8 — 1. Then If(£ x , is equivalent to the field of rational 

functions K(x v ...,a; s _ x ). Now 

and since £ s is transcendental over 2f(£ x , ..., £ s _i), K (£ x , ..., £ g _ x ) (£ g ) 
is equivalent to if(£ x , ..., £ g _ x ) (a; g ), and hence to 

K(xi, ...,# g _ x ) ( x a ) = K{x±, ...,x a ). 



2. EXTENSIONS OF A COMMUTATIVE FIELD 105 

The elements § v ...,£ r are said to be independent indeterminates 
over K. 

In this book we shall only be concerned with the extensions of a 
field K obtained by adjoining a finite number of elements ...,£ n 
to K. 


3. Extensions of finite degree. An extension K* of A is said 

to be of finite degree if there exists a finite number of elements 

§ x , £ n of K* such that any element y of K* is expressible in the 

form r , , r 

V = «i5i + — +a»6»» 

the a v ...,a n lying in K. The elements £ x , ...,£ n are said to form 
a basis for K* over K. This basis is a minimal basis if the equation 


<*l£l + ••• +<*n£n = 0 

implies that — 0 (i = 1, n). 

The elements of K* clearly form a linear set 

over K [II, § 1], and therefore if K* is of finite degree over K we 
deduce the existence of a minimal basis, the number of elements 
in the basis being the same for all minimal bases [II, § 1, Th. VI]. 
This number is called the degree of the extension K* of K. 

Theorem I. If K* is an extension of K of finite degree n, every 
element of K* is algebraic over K, and has a characteristic polynomial 
of degree not greater than n. 

Let £ v ..., £ n be a minimal basis for K* over K. Then, if y is any 
element of K*, rj £ { is an element of K* and 


V£i = a <l £i+ — + <Xin£n ( i= L •••>»). 


The linear equations 

( a n — y)x 1 + ...+ oc ln x n = 0, 


oi nl Xi+... + (a nn -y)x n = 0, 

have a solution in K*, namely, (£ 1; ..., £ n ), which is not trivial. 
Hence 


a u- V <*i2 


<*i* 


= 0 . 


&21 <*22 V • <*2n 

<*»i “na 


• <*nn“7 
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The polynomial 

m- 


■XI — X 

a ia 


«21 

a 22 -a; 

• a 2n 

• 

• 

• • 

<*» 1 

a rt2 

7171 


is not zero, the term of highest degree being ( — l) n x n . Since rj 
satisfies the equation . 

F(v) = 0 , 

7] is algebraic over K. Moreover, its characteristic polynomial is a 
factor of F(x); hence its degree cannot exceed n. 

An extension K* of K in which every element is algebraic over K 
is called an algebraic extension oi K. We have proved that every 
extension of K of finite degree is algebraic. We now prove 

Theorem II. If the elements £ x , ..., of an extension K* of K are 
algebraic over K, then A(£ x , ...,£„) is an algebraic extension of K. 

We first prove that any element ij of A(£ x , ..., £ n ) can be expressed 
in the form x x 

V='l> ••• *E 

i,=0 i.-O 

the coefficients in lying in K, and r x , . . . , r n being integers which 
are independent of the element y. We prove this result by 
induction, assuming the truth of our statement for J5 l(£ x , ..., £ re _ x ). 
Since £ n is algebraic over K, it is also algebraic over A(£ x , ...» £ n -i)- 
Let r n be the degree of its characteristic polynomial with respect 
to this field. Then, since K(£ v ..., £ n ) is obtained by adjoining £ n to 
JT(£ X> x), any element y of K(£ v ...,£„) can be written in the 

form y = e 0 +G 1 £n+-+o r ^ 1 &-\ 

the Q i lying in K(fi v But by the hypothesis of induction 

we can write 


r,-l r»_i-l 


~ E ••• S (® ~ • • •> r n 1)» 

<,=0 0 

where the integers r x , ..., r n _ x are independent of and hence 
of 7 ], and the coefficients ot ii in ii lie in K. Hence 

ft— 1 r»-l 

v = S — S <*<,.„<,£ 5* — SSr. 

i,-0 in-o 

This proof is valid when n = 1 and therefore our result is proved. 



3. EXTENSIONS OF FINITE DEGREE 


107 


It follows that the r 1 r 2 ...r n elements 

(h = 0, 1; i n = 0,...,r n -l) 

form a basis for #(£ 1( ..., £ n ). Hence, by Th. I, any element of 
#(£ x , ...,£ n )is algebraic over #. By definition, therefore, #(£ x , ...,£„) 
is an algebraic extension of #, and the theorem is proved. 

Suppose now that of the elements £ x , ...,£ n the first r, £ 1# ...,£ r , 
are independent indeterminates over #, but that £ r+1 , . . . , £ n are alge- 
braically dependent on*# (£ lf ...,£ r ). Then K(fi v is an algebraic 

extension of #(£ x , ...,£,.). It is called an algebraic function field, and 
its elements are algebraic functions of the indeterminates &.-.&• 

Theorem III. If K* is any extension of #, the elements of K* 
which are algebraic over K form afield #', where 

K^K'^K*. 

If £ and r\ are elements of #* which are algebraic over # so is 
every element of #(£,?/), by the previous theorem. Hence, the 
elements £ — 17 an d £>/, lying in #(£,?/), are algebraic over #. It 
follows that the elements of K* which are algebraic over # form 
an integral domain which contains # as a subring. Further, if £ 
is algebraic over #, but not zero, £ _1 lies in #(£) and is therefore 
algebraic over #. Hence the set of algebraic elements of #* form 
afield#'. Clearly #c#'c#*. 

Theorem IV. If K* is any extension of #, and #' a subfield of K* 
whose elements are algebraic over #, then any element of K* which is 
algebraic over #' is also algebraic over #. 

We do not assume that #' contains #. Let £ be any element of 
#* which is algebraic over #', and let 

x n + 7j 1 x n ~ 1 + ...+r/ n ( ijiinK ') 

be its characteristic polynomial with respect to #'. Consider the 
field K(rj v ...,?/„,£). Any element can be written as 

S (£< in K(Vi> -••>Vn))- 

i-X 

But for each i we can write 


b< = s • 

i,-l 


6 - 


•/s 4. 

U - 1 


. 4" 1 


Vn' 


in#), 


and therefore #(x/ 1( ...,x/ n , £) has a finite basis over #, and hence, 
by Th. I, it is algebraic. It follows that £ is algebraic over #. 



108 


in. ALGEBRAIC DEPENDENCE 


We now introduce the notion of algebraic dependence. A set of 
elements ..., £ r of an extension K* of K is said to be algebraically 
d&pendent over K if there exists a non-zero polynomial f(x v ...,x r ) 
in K[x v ...,x r ] such that 

/(&,.... &) - 0 . 

If not dependent the elements are algebraically independent over K. 

In the case r = 1, £ x is algebraically dependent over K if and 
only if it is algebraic over K. A proof similar to that of § 2, Th. I, 
enables us to show that if £ 1( ..., £ r are algebraically dependent over 
K there exists an irreducible polynomial f(x v ...,x r ) such that 

o. 

If £ r are algebraically independent, £ r are independent 

indeterminates over K, and therefore K(£ v ..., £ r ) is the field of 
rational functions of the indeterminates £ r . 

We have already remarked that there exists in K(£ v ..., £ n ) a set 
of indeterminates, which we may take as ...,£ r , which are 
algebraically independent over K, and which are such that any 
element of K(E, V ..., £ re ) is algebraic over ...,£ r ). More gener- 
ally, there may exist in any extension K* of A a finite set of elements 
i v . . ., £ r which are algebraically independent over K, and such that 
any element rj of K* is algebraic over if(£ x , ...,£ r ). When this is 
the case we say that K* has & finite algebraic basis with respect to 
K, and since ...,£ r are algebraically independent over K, these 
elements are said to form a minimal algebraic basis. 

Certain properties of extensions K* of K with finite algebraic 
bases are similar to properties of linear sets of finite dimension 
[II, § 1]. We now prove the generalised form of the Exchange 
Theorem. 

Theorem V. If (7] v ,...,-rj s ) is any algebraic basis for an extension 
K* of K, and £ lt . . . , £< are t algebraically independent elements of 
K*, then t^s, and for a suitable rearrangement of rj lt the set 
(Si, •••>£<, Vt+i> ■■■> Vs) ^ a basis for K*. 

We consider the case t = 1 first of all. Since is in K*, it is 
algebraically dependent over K(ij ly ...,y s ). Therefore there exists 
a non-zero polynomial/^, . . . , z s+1 ) such that 

fiVi Vs, Ci) = 0. 

If s = 0, this implies that is algebraic over K, which is contrary 
to hypothesis. Hence O 1. 
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Again, and for the same reason, one at least of the indeterminates 

x t x a must be present in f(x v x a+1 ). We may suppose that the 

ij { are rearranged so that x 1 is present. Then it follows that rj l is 
algebraically dependent over K(£ v rj 2 , ..., i/ 8 ). By hypothesis, any 
element £ of K* is algebraically dependent over K(ti lf ij t ,...,ii a ). 
But we now know that ij 1 , rj 2 , . . . , rj a are algebraic over K (£ x , i/ 2 , i/ g ). 
Hence, by Th. IV, we deduce that £ is algebraically dependent over 
K(E, v rj 2 , ...,71a). This proves the theorem when t = 1. We now 
proceed by induction on t. 

It is clear from the definition of algebraic dependence that if 

£ 1} ...,£, are algebraically independent over K, any subset is also 

algebraically independent. In particular, £ x , . . . , are independent. 

Hence, if we assume that our theorem is true for t— 1 elements £ j; 

we have the relation J , „ 

t- 1 

and we may suppose that the ij v ..., ij a are arranged so that the set 

(£l> • • • > 1st— 1> Vt> •••>Vs) 

is an algebraic basis for K* over K. Since is in K*, it is algebraic 
over K(£ v ...,S<-i»%» •••,’/«)• That is, there exists a non-zero poly- 
nomial f(x v ..., x s+ i) such that 

-..%»&) = °- 

Iff — 1 = s, it would follow that £, v ...,£< are algebraically dependent, 
contrary to hypothesis. Hence 

t — 1 < s, that is t < s. 

Again, one at least of the indeterminates x t ,...,x s must be present 
in f(x v a; 8+1 ), otherwise £ 1# . . . , would be algebraically dependent 
over K. We may suppose that the i/ i are rearranged so that x t is 
present. Then it follows that rj ( is algebraically dependent over 

A(£i, ••• > 1 > Vt+v Va)" 

By hypothesis, any element E, of K* is algebraically dependent over 
K(ti, • • • » Vt> • • • > Vs)- But we now know that Vo--, Vs 

are algebraic over K{£ lt ...,£i,v l+1 , Hence, by Th. IV, we 

deduce that £ is algebraically dependent over ..., rj t+1 , . . . , v s ). 

Hence (£i, •••>Vs) an algebraic basis for K*, and the 

theorem is proved. 

We deduce from the first part of this theorem that any set of 
t elements, where t>s, must be algebraically dependent. Now 
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suppose that the set (r/ ly ...,v} a ) is a minimal basis for K*, and let 
(Ci, ...,C*') be another minimal basis. Then 

s' < s and s 5% a’, 

and therefore s — s'. 


Therefore the number of elements in a minimal algebraic basis for 
K* with respect to K is independent of the minimal basis chosen. 
We call this number the dimension of K* with respect to K. 

Theorem VI. If £ lt ...,£ r is a minimal algebraic basis for an 
extension K* of K, and C is any element of K*, there exists a non-zero 
irreducible polynomial f(x lt ...,x r+1 ) in K[x lt ...,x r+1 ] such that 

/(£l,...,£r,C) = 0. 

If g(x v x r+ f) is any other polynomial such that 

0(£i. •••>&»£) = °. 


then g(x ly x r+i ) = f(x v ...,x r+1 ) h(x v x r+1 ), 

where h(x lt ..., x r+ f) is in K[x v ..., a; r+1 ]. 

Since C is algebraic over K(E , V £ r ) there exists a non-zero 
irreducible polynomial f(x l} ...,x r+1 ) in K[x x , ...,x r+1 ] such that 

Ml,— , = 0. 

Now let g(x v ...,x r+ f) be any polynomial such that 

g(ii,-,Zr, O = o. 


Since f(x v ...,x r+1 ), regarded as a polynomial in K(x v ...,x r ) [a; r+1 ], 
is irreducible, g(x v ...,x r+1 ), likewise regarded as a polynomial in 
K(x v ...,x r ) [» r+1 ], either contains f(x v ...,x r+1 ) as a factor, or the 
two polynomials are relatively prime. 

Case (i). Suppose that 


g( x i> -,*r+ 1) = /(* i» ->*r+ 1) • 

Then k(x lf ...,x r )g(xi, • ••,2'r+i) = f( x i> • ••>*>+ i)h( x i, 


Since K[x ly ...,x r+ f\ is a unique factorisation domain, and 
f(x 1} ...,a; r+1 ) is irreducible, and contains terms in x r+1 , the poly- 
nomial k(x v ...,x r ) must be a factor of h(x v ...,x r+1 ). Hence, after 
division by k{x lf ...,x r ), 

g(x i, . . . , x r+ i) = f(x i, . . . , x r+ f) h (x lf . . . , x r+ f). 
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Case (ii). If f(x v x r+1 ) and g(x v ...,x r+1 ) are relatively prime 
as polynomials in K(x v ...,x r ) [a; r+1 ], there exist elements 
a(x lt x r+1 ) and b(x v ...,a; r+1 ) of K(x v ...,x r ) [z r+1 ] such that 

a(x v ...,x r+1 )f{x v ...,x r+1 ) +b(x lf ...,x r+1 )g{x lt ...,x r+1 ) = 1. 


Let 

a(x v . 

••> X r+ 1) ~ 

A 1 (x v ..., x r ) ’ 

and 

b(x v . 


B(X i, ---yXr+x) 
B 1 (x 1 ,...,x r ) ’ 

where A(x v . 

..,x r+1 ) and B(x v .. 

.,x r+1 ) lie in K[x lt ...,x r+i \, and 


...,x r ) and B 1 (x 1 , ...,x r ) lie in K[x v ...,x r ]. Then 

• ■•,X r )A(X i, •••> a V+l)/( a 'l> •••> a V+l) 

+ A t (x v ...,x r ) B(x v .... x r+1 ) g(x v x r+1 ) 

= Ajfa,..., x r ) Bjfa,..., x r ), 

and therefore 

tr, Of -;0,0 

+ A Sv ■ ■ ■ , ir) Bax, • • • , in 0 g{ix, -,Zr,0 

= 0 . 

Hence 0 ,...,0 are algebraically dependent over K, contrary to 
hypothesis. The only possible case is therefore the first, and the 
theorem is therefore established. 

4. Factorisation of polynomials. Let f(x) be any irreducible 
polynomial in K\x\, and let us use it, as in § 1, to construct an alge- 
braic extension K0) of K. Now consider f(x) as a polynomial in 
Ka) IX] • By the Remainder Theorem [I, § 6], 

f(x) = {x-OfxW, 

since /(g) = 0, f-fx) lying in JT(g) [*]. Thus, when we extend K to 
Ka), f{^) becomes reducible. More generally, let F(x) be any 
polynomial of degree n in K\x\, and suppose that 

F(x) = F t (x)F 2 {x) ...F k (x), 

where the factor F t (x) is irreducible and of degree n* > 0. Then 

k 

2 % = n. 

x 



112 


m. ALGEBRAIC DEPENDENCE 


If, for some i, n i > 1, we can construct an algebraic extension of K 
in which Ffx) is reducible; that is, we can find an algebraic extension 
K' of K over which F(x) has at least k + 1 factors. Proceeding in 
this way, we reach a field K*, which is an algebraic extension of K, 
over which F(x) is equal to the product of n factors, each of the first 
degree. When a polynomial with coefficients in a given field can be 
expressed as the product of linear factors, it is said to be completely 
reducible over the field. Hence we have 

Theorem I. Given any polynomial f(x) in K[x ], there exists an 
algebraic extension of K over which f(x) is completely reducible. 

This theorem takes the place in our present work of the ‘ Funda- 
mental Theorem of Algebra ’, which asserts that when K is the field 
of complex numbers every polynomial in K[ x] is completely re- 
ducible over K. 

A field K which is such that every polynomial in K[x] is com- 
pletely reducible over K is said to be algebraically closed. The field 
of complex numbers is one example of such a field. In later chapters 
we shall often begin by considering a field K which is algebraically 
closed. But we cannot restrict ourselves to such fields, for the 
adjunction of an indeterminate to K gives a field which is not 
algebraically closed. For instance, if K is the field of complex 
numbers, and x, y are indeterminates, x^ + y 2 — 1, regarded as a 
polynomial in y, is irreducible over K(x). We shall therefore 
investigate fields which are not algebraically closed, merely stating 
the important 

Theorem II. Given any field K there exists an algebraic extension 
K* of K which is algebraically closed. K* is uniquely defined by K to 
within equivalent extensions. 

This field K* is called the algebraic closure of K. We shall not use 
this theorem, and therefore omit the proof, which is difficult. 

Let us return to the commutative field K in which the polynomial 

F(x) = F 1 (x)...F k (x) 

is of degree n, the F t (x) being irreducible polynomials in K\x\. Let 
K* and K* be any extensions of K over each of which F(x) is 
completely reducible. If 

F(x) = a n (* - £i), and F(x) = a n (» - y { ) 

i - 1 i - 1 

are the respective factorisations of F(x) in Kf, K$, we examine the 
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relation between the extensions K(£ v and K(r/ V 

These fields are subfields of Kf and K* respectively. Since we shall 
not go outside these subfields, we may suppose that 

= and Kt = K( Vv ..., Vn ). 

Then iff and if* are both algebraic extensions of K [§ 3, Th. II]. 
We prove that they are equivalent extensions of if. 

The proof is by induction on n. If n = 1 there is nothing to prove, 
since iff — K — K*. We therefore assume that the theorem is true 
for polynomials F(x) of degree less than n. If the polynomial we 
are considering has a linear factor x — a in if [a;], 

= m a,. 

and the theorem follows by the induction hypothesis. We now assume 
that no irreducible factor F t (x) of F{x) is of degree 1, so that none of 
...,£„ and tj v is in K. Since iff is a field, if* [a;] is a unique 
factorisation domain, and hence the factors of F t (x) are certain of 
the factors a: — £*. By renaming the factors, if necessary, we may 
assume that £ — is a factor of F x {x), so that F^j) = 0. Similarly, 
we may suppose that F^rjj) = 0. By § 2, Th. II, the fields if(£ x ) and 
K(r) 1 ) are equivalent. It follows that the rings if(£j) [a;] and K{rj^) [x] 
are equivalent, so that if 

F(x) = Ff>(x ) ... Fp(x) 

is the factorisation of F(x) in K{i^)[x], the polynomials Ff l) (x) 
being irreducible polynomials with coefficients in then also 

F(x) = F[^(x ) ... F% l) (x) 

is an equivalent factorisation of F(x) in [x], the polynomial 
F^ l) (x) being the isomorph of Ff'\x). But F(x) has now a hnear 
factor x — in K(£,^) [x], or x — r) l in K(?} 1 ) [a:], so that, by the in- 
duction hypothesis, we may assume that the extension obtained by 
adjoining | 2 , . . . , | w to If (£ x ) is isomorphic with the extension obtained 
by adjoining i) z , ...,i f n to K^^), the isomorphism being such that 
any element of K (£ x ) is the isomorph of the corresponding element 
of the equivalent field K(i) i). Hence, K(£ v ...,| n ) and K(7j v 
are equivalent extensions of K. 

If F(x) is any polynomial of degree n in K[x~\, and, if over some 
extension of K n 

F(x) = »n(*-u 

<=i 


HP I 


8 
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the field K(£ v ..., £ n ) is called a root field of F(x). We have therefore 
proved 

Theorem III. The root field of a 'polynomial F(x) in K[x] is 
uniquely defined to within equivalence over K. 

We may use these theorems to illustrate an important observa- 
tion, namely that, given two extensions K x and K 2 of a field K, 
there may be no extension K* of K which contains K x and K 2 as 
subfields. To see this, let F(x) be an irreducible polynomial over 
K, of degree n(n> 1), and take K t and K 2 to be equivalent, but 
not identical, root fields of F(x) such that K 1 and K 2 intersect 
only in K. Suppose that £ 1; ...,£ ra are the roots of F(x) in K lt and 
that rjv ...,rj n are its roots in K 2 . Then £ f 4?^ for any choice of 
i, j. If K* were a field containing both K x and K 2 , we should 
have, over K*, n 

F(x) = a n 

i — 1 

and F(7] j ) = 0. 

Hence II (Vj -&) = <>, 

i «=1 


and therefore i) f = £ fc for some choice of k, contrary to hypothesis. 

If F(x) is any non-zero polynomial in K[ x], any element £ in K, 
or in an extension of K, such that F{£) — 0, is called a root or zero 
of F(x). We now prove 

Theorem IV. A necessary and sufficient condition that two poly- 
nomials in K[x~] should have a common root is that their highest common 
factor should be of degree greater than zero. 

Let F(x) and 0(x) be the polynomials, d(x) their highest common 
factor. Then there exist polynomials a(x), b(x),f(x), g(x), all in K[x], 

such that a(») F(x) + b(x) 0(x) = d(x), (1) 

and F(x) = d(x)f(x), 0(x) = d(x) g(x). (2) 


Let K* be an extension of K such that F(x) and 0(x) are completely 
reducible in K*[ x]. Then d(x) is also completely reducible. If £ is 
a common root of F(x) and 0(x) then, from (1), £ is a root of d(x), 
and if £ is a root of d(x) we see, from (2), that £ is a root both of F(x) 
and 0(x). This proves the theorem. 

We note that the highest common factor of F(x) and 0(x) can be 
found by operations in K [x] [I, § 6], but the common root of F(x) 
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and 0(x), if there is one, may lie in some algebraic extension of K, 
defined by d(x). 

The theory of root fields is a large one, but most of it lies outside 
the scope of this book. We conclude this section with an account of 
the symmetric functions of the roots of a polynomial. 

Let x v ...,x n be n independent indeterminates over K, and let 

\ 

c 1 = x 1 +...+x n , 

= "b *®i *^3 ”b • • • "b ®n— 1 '*'n> 


C n — Xi ... X n . 

We call c 1 ,c 2 , ...,c n the elementary symmetric functions of x x> 

They are evidently unchanged by the n\ permutations of the in- 
determinates x v ...,x n . Any polynomial in K[x v ...,*„] is said to 
be a symmetric integral function of x v ...,x n if it is unaltered by the 
n\ permutations of x v ...,x n . We prove that such a polynomial is 
expressible in terms of the elementary symmetric functions, or, 
more precisely, we prove 

Theorem V. If f(x v ...,x n ) is a symmetric integral function of 
*i, •••»£„, it lies in K[c v ...,cj. 

n 

We define the total degree of an expression asp a# ... x% to be 2 r i> 

i = 1 

and the total degree of /( x v x n ) to be the maximum of the total 

degrees of its terms x^x^' ...x r ” for which the coefficient a Ti ra ... r „ 4= 0. 
We prove the theorem by induction, first on n, then on the total 
degree of f(x 1} ...,x n ). 

In the case n — 1, c x = x v and hence K[xf[ — K[cf[, and the 
theorem is trivial. On the other hand, if f(x v •••,x n ) is of total 

degree 1, f(x v ...,x n ) = a t x t + ...+a n x n +b. 

Interchange x { and x } . Since f(x v ...,x n ) is symmetric, we must 
have a i = ay We deduce that 

f{x v ...,x n ) = a(x 1 +...+x n ) + b 

— CtCj + 6* 

It therefore lies in K\c x , ...,c n ]. Having established our theorem 
when n — 1 , we assume that it is true for symmetric integral func- 
tions of x v ...,x n _ v It is true for symmetric integral functions of 
x v ...,x n of total degree 1 ; and so we assume it is true when the total 

8-3 
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degree is k—\, and prove it is true for the symmetric polynomial 
f(x v . . . , x n ) of total degree k. 

Write f( x V • • • , x n) = fo +fl X n + • • • +fk x n> 

where / 0 , . . . ,f k are in K[x v x ri _ 1 ] . If we consider the permutations 

of x 1 ,...,x n which leave x n unchanged, f(x v ...,x n ) is unchanged, 
and therefore / fc are unchanged by the permutations of 
x v ..., x n _ v It follows that/ 0 , are symmetric integral functions 

of x v ...,x n _ v By the induction hypothesis each of these poly- 
nomials, and in particular/,,, is expressible in terms of the elementary 
symmetric functions of x v . . x n _ v Writing 

d 0 = 1, 

d x = x i + . . . + x n _ v 

d ‘2 — “b X 1 X Z "b • • • ”b x n — 2 x n— i> 


d n -i — x \ x 2 ■■■ X n -\1 

< = o, 

we have /o = ^o(^i> 

and we note that 

= di + x n di_ x ( i = 1, (3) 

Now f(x i, = /„ + x n F{x v . . . , x n ), 

and, by (3), 

0o( c l> • • • > c »-l) = 0o( d l> • • • » d n - 1 ) + x n^ > o( x l> x n)‘ 

Therefore 

f( x V •••»«„)- <j>o(Cv • • • . c n-l) = •••>*„)- ^o(*l» ■ ■ • . *n)]- ( 4 ) 

The total degree of 0„( c i> •••> c n-i)> regarded as polynomial in 
x v -.-,x n , is equal to that of $ 0 (d v d n _ 1 ), which is that of/ 0 . Hence, 
since this cannot exceed k, the total degree of the right-hand side 
of (4) cannot exceed k. Since the left-hand side of (4) is a symmetric 
integral function of x v ...,x n , the right-hand side must contain, 
besides x n , the factor x 1 x 2 ... x n _ v We can therefore write 

f( x l> • • • > x n) ~ 0o( c l> •••> c n-l) = *1 ••• X n @( X 1> ••• > X n ) 

= c n 0(x v ...,X n ), 



4. FACTORISATION OF POLYNOMIALS 117 

where 0(x v ...,x n ) must be symmetric in x v ...,x n , and is of total 
degree < k—n . By hypothesis, we can write 

0(x v ...,x n ) = rjr(c v ...,c n ), 

and finally 

f{x v ...,x n )= <f> 0 (c v . . . , c n _j) + c n f{c v ...,c n ), 

and thus f(x lt ... ,x n ) lies in K[c v ...,c n ]. This proves the theorem. 

This theorem is mainly used to connect the coefficients of a 
polynomial with the roots of the polynomial. We adjoin c v ...,c n 
to K, obtaining a field K* which is a subfield of K(x v ...,x n ). We 
consider the polynomial in 

f(x) = x n — c x x n ~ l + c 2 a; n_2 —... + ( — 1 ) n c n . 

In an extension of K*[ x] this equation can be written as 

f(x) = (x-zjix-zj... (x-x n ), 

and Th. V can be regarded as a property of the root field of f(x). 
Again, consider the polynomial 

F(x) = x n — v 1 x n ~ 1 + l) n v n 

in K\x], and let ...,£ n be its roots, in some extension K* of K. 
Any element of the root field K{E, lt ...,£„) can be written as a 
polynomial in with coefficients in K [§ 3, Th. II]. Let 

a(£ 1# denote such a polynomial. It is said to be a symmetric 

function of the roots | 1( ...,£ n if a(x v ...,x n ) is a symmetric integral 
function of the indeterminates x v . . ., x n . By Th. V, 

a{%i, b{c j, ...,c n ), 

where b(c v ...,c n ) is a polynomial with coefficients in K. Hence, 
by specialising x { ^ t - (i = 1, we find that 

®(£l> •••> = b(V\, ..., 

and we have proved 

Theorem VI. If F(x) is any polynomial in K[x~\, any symmetric 
function of the roots of F(x) lies in K. 

On the other hand, it may happen that a( | 1 , ...,^ n ) is in K, 
although a(x x x n ) is not symmetric. 

Theorem VII. If y is any element in the root field of F(x), the 
characteristic polynomial of y over K is completely reducible in the 
root field. 
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As above, let £ v ..., £ n be the roots of F(x), and let us write 

V = 

where a(x v ...,x n ) lies in K[x v ...,x n ]. Now let 
a *( x v •••> x n) (*» 1,. ..,»0 

denote the result of performing a permutation on the indeterminates 
x v ...,x n in a(x v ...,x n ). Consider the polynomial 

nl , 

II [ x-a t(x v ...,a; n )] = x nl + b 1 x n '~ 1 + ... +b nl . 

i = 1 

The coefficients b { are clearly symmetric integral functions of 
x v . . . , x n , and therefore if 

i=i 

4>(x) lies in K\x\. But <j>(r}) = 0, and therefore the characteristic 
polynomial of y is a factor of <p{x). Since 4>{x) is completely reducible 
in the root field, so is the characteristic polynomial. 

We conclude this section with 

Theorem VIII. The elementary symmetric functions c lt ...,c n of 
n independent indeterminates over K are themselves algebraically 
independent over K. 

Let the indeterminates be x v If n = 1, c x = x lt and the 

result is evident. We prove the theorem by induction on n, using 
the notation of Th. V. By hypothesis, then, d lt ...,d n _ 1 are 
algebraically independent over K . Now, if c v . . . , c n are algebraically 
dependent, there is a relation 

0(Cl, = 0o + 0i c n+ ••• + & c n = 0, 

where the he in K[c v ...,c w _J, and are not all zero. Putting 
x n = 0 in this identity, we find that 

(j)(d v ...,d n _j, 0) = f>o(Ci, c„_ 1 ) Xn _o = 0o(^i» i) 

- 0. 

By hypothesis, <f> 0 (d i, . . . , d n _f) = 0 imphes that <f 0 is identically zero. 
Hence, if <f(c v ...,c n ) = 0, we have 

...,c n ) = c n ^(Ci, ...,c n ) = 0, 

and therefore ft(c v . . . , c n ) = + . . . + c l ~ 1 = 0. 

The theorem now fohows. For we prove, in succession, that 
0 i, 02 > are ah identically zero. 
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5. Differentiation of polynomials. Let x be an indeterminate 
over K, and let 

f(x) = a 0 + <x 1 x + oc 2 x i + ... + a n x n 
be any polynomial in K[x\. Then, if h is any new indeterminate, 


f(x + h) -f(x) = a 0 - a 0 + a x [(x + h) - x] + . . . + a. n [(x + h) n - x n ] 

= h^(X, x + CC 2 [x + h + + . . . + "b h) n ~ 1 

+ x(pc + h ) n ~ 2 + . . . + a:”- 1 ]}, 


and therefore 


f(x + h)-f(x) 


= /'(*) + hg(x,h), 


where g(x, h) is a polynomial in K[x, h], and 

f(x) = a 1 + 2a i x + Sa s x 2 + ...+na n x n ~ 1 . 

The polynomial/'(a;) is called the derivative of f(x) with respect to x, 
d df 

and is often written ^/(x) or 

Let us consider in what circumstances f(x) = 0. We must have 
a x = 0, 2a a = 0, na n — 0. 


If K is without characteristic, this implies that 

a 1 = a 2 = ... = a n = 0 , 


and therefore f(x) = a 0 , and so lies in K. Conversely, if f(x) is in K 
we find that /'(a;) = 0. 

If K has characteristic p, the equation f'(x) = 0 implies that the 
coefficient = 0 only when i is not a multiple of p. In this case, then, 
f(x) lies in K[x v '\. Conversely, if f(x) is in K[x p ], the derivative is 
zero. This difference between fields with and without characteristic 


has many important consequences, and will shortly lead us to 
impose the restriction that K be without characteristic. 

Let us first note some formal consequences of the law of differ- 


entiation. If 


9( x ) ■= fio + fi\X+ ...+ fi m x m . 


we see at once that 


[ft x ) + 9( x )Y = (<*! + &) + 2(a 2 +0 2 )x+... 

=f'( x )+g'( x ), 
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£/(*)?(*)]' 

- “ [ a oA) ( a l A) “I" a 0 A.) #+••• + (&rfio + a r- 1 A. + • • • + a o A) *^ r 

+ -+“X* M+n ]' 

= K Ao + <*o A) + • • • + r(ot r /3 0 + . . . + ct 0 fi r ) x'- 1 

+ ... + (m + n) a, n /3 m x m+n - 1 

= (a x + 2a 2 x + . . . + wa^x"- 1 ) (/?„ + A* + . . . + fi m x m ) 

+ (a 0 + a, x x + . . . + oc n x n ) (A + 2/? a x + . . . + m^ m x m ~ 1 ) 

= f'(x)9(x) +f{x)g'{x). 

As an immediate corollary, we have, for any positive integer r, 
{[f(x)ry = rf'(x)[f(x)y-\ 

We extend the definition of a derivative to rational functions of 
x as follows. 

if 

»<*) 

so that yg{z)=f(x), 

we define y' , formally, by the equation 

/'(*) = y'g(x)+yg'(*), 

and therefore we have the equation 

, /» f(x) g'{x) f'(x) g(x) -f(x) g'{x) 

y g(x) [g{x)f [<?(*)] 2 


In the particular case in which g(x) is a factor of f(x), y is a poly- 
nomial h(x), and we can verify that y' = h'(x). 

The operation of differentiation may be repeated any finite 
number of times. If / (8) (x) denotes the sth derivative of f(x), the 
following results are immediate: 

(i) [/W(x)F=/ ( ^)(x); 

(ii) [x n ] (r) = n(n — 1 )...(» — r + 1 ) x n ~ r (r^n); 

(iff) / (r) (x) = 0 

if/(x) is a polynomial of degree less than r; 

(iv) if K is of characteristic p, and r^p, 

fto(x) = 0. 

This follows from (ii), since, if r^p, one at least of the integers 
n,n—l, ...,n — r + 1 is a multiple of p. 
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(v) if K* is an extension of K, and/(x) is in K[x\, the derivative 
of f(x) in K*\x\ is equal to its derivative in K\x\. 

If f(x v ...,x n ) is a polynomial in K\x lt a; n ], we may consider it 
as a polynomial in K[x v ...,x r _ v x r+1 , x n ] \x r ], and so define its 
derivative with respect to x r . We call this the partial derivative with 
respect to x r , and write it 

a 


dx. 


f(x v ...,x r ). 


When r — \, 
Now 


y = # 

dx t dx± ' 


(ax? 1 ...x? r ...x? H ) = m r ax? 1 ... x^lj'x^'^x?^? ...x%", 


dx r 

and therefore, if r 4=s, 

a 2 


dx s dx r 


(ax? 1 . . . x: *•) = (ax? 1 . . . x?*) 


= m T m s ax? 1 ...x?i-?x? i, ~ x ...x?±i i x? , ~ 1 ...x?* 

and this relation is obviously true when r — s. Similarly, we show 
that 

and we can write either of these expressions as 

a® 


dx r dx s cx i 


[ax? 1 ...X?*]. 


Since the polynomial's:!, ...,x n ) is a sum of terms like ax? 1 . . . x?", 
we obtain the commutative and associative laws 

3 ^/ (x ‘ x " ) = 3 ^ /(a: > x ">' 

k(S^, Axi ' = *->) 


dx r dx s 

a 8 

dx r dx s dx, 


f(x v ...,x n ). 


and 
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Our next result is valid without restriction only when the field 
K is without characteristic. As a preliminary we note that the usual 
elementary proof of the multinomial theorem 


(* 1 +...+x n ) r = r- 


r! 




. X 


>Tn 
n > 


where the summation is over all ordered partitions (r v ...,r n ) of r, 
is valid over any field K without characteristic. 

Now let 


^yxffa 1 > ^ 


rl 


ri !.../•„! 




, drf(x v .",X n ) 
yn dx{' . . . dx% 


We can write this symbolically as 

( 0 g \r 

yi dx 1 + '” + yn d^J ^* 1 * —»**)• 

We prove 

Taylor’s Theorem. // f(x 1 ,...,x n ) is any polynomial in 
K[x v of total degree N, then 

N ! 

/(SJl + T/l, ...,* n + 2/ n ) = 2 

r= O' • 


Let us first establish this theorem when n = 1. If 


f(x) = a 0 + a 1 x+ ...+a N x N , 
then f(x + y) = a 0 + a 1 (x + y) + ... + ot N (x + y) N 
= a 0 + a x x + . . . + a N x N 

+ 2/t a i + 2a 2 a;+ ... + ^a Jv x Ar_l ] 

+ |j [2.1a 2 + 3.2a 8 a;+...+iV(iV-l)a Jv a; iV - 2 ] 

+ ... 


-/W+^^+ 


3/(*) . 1 ..»8y(g) 


9* '2!^ 0« 2 


■ + .. 


+ ± v n?m 

+ Nl y dx N 
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We now proceed by the method of induction, assuming that the 
result is true for any polynomial in less than n indeterminates. If 

+ - +yn ~ i di^j ’ 

we verify by direct calculation that 

( 3 3 3 \ r 

y 'dx 1 + "‘ +yn - 1 d /to’ •••»*») 

= S o Q y S n TO'/(*1. ...»*•)]. 

Now, by the induction hypothesis, 

N 1 

f( x l + Vv • • •» *n-l + 2/n-l> *n + 2/n) = S 71 D lxf( x v x n-l> *» + 2/J, 

s-0 5 ! 

and, applying Taylor’s Theorem for the case n — 1, we have 

ns i y. 

D s y X f(x 1, . . . , x n _ v x n + y n ) =^j\y l n^ D s yx f{x 1, ...,x n ). 
Therefore 

N l Ns 1 0/ 

ffel + Vv •• •> X n~* r Vn) “ X Tj 2 Dy x f(%V 

8=0^1 t = 0 * i 

The summations on the right are over a lattice of points on the 
boundary of and within an isosceles right-angled triangle of side N. 
We may exhaust all the points of the lattice by considering those 
on lines parallel to the hypotenuse. In other words, put 

s + t = r (O^t^r), 

and let r vary from 0 to N. Then 

f(x 1 + y l , ...,x n + y n ) =^Z h y l n g^J D lx l f( x v • • • » x n) 

N 1 

== 2 ~i ^yxf( x lt •••y x n)> 
r = 0 ' 1 

by the result stated above. This proves the theorem. 

We now consider certain applications of the process of differentia- 
tion to the theory of the roots of a polynomial. 
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Theorem I. Iff(x) is any polynomial in K[x] of degree n greater 
than zero, a necessary and sufficient condition that (in an extension K* 
of K in which it is completely reducible) f(x) should have less than n 
distinct roots is that the highest common factor of f(x) and f'(x) be of 
degree greater than zero, or else thatf(x) = 0. 

The latter case only arises in the case of a field of finite 
characteristic. 

(i) The condition is necessary. Suppose that f(x) is completely 
reducible in K*\x\ and that 


/(*) = a(x-£ 1 ) m i...(x - &.)»•»■ (m t > 0 ), 
where £j £ r are distinct. Then 

f(x) = ^) m i ... (x- i - 1 ...(«- £ f )”V. 

i-1 

If m v ...,m a > 1, m s+1 , ...,m r = 1, then 

f(x) = 0 , 

or else (x — ... (x — £ g ) m » _1 

is a common factor in K*\x\ of f(x) and f(x). Since f(x) and f'(x) 
are not relatively prime in K*[ x], they cannot be relatively prime 
inZ[a:] [I,§6,Th. II]. 

(ii) The condition is sufficient. Consider, first, the case in which 
f(x) is zero. Then, as we saw above, p. 119, K must be of finite 
characteristic, say p, and f(x) is in K[x p ], Let 

f(x) = <f>( x p ), 

where <f>(y) is in K[y], There exists an extension K’ of K such that 
<p(y) is completely reducible in K’[y]: 

r 

<f>(V) = aU(y + Vi) ( rp = n ). 

i= l 

There also exists an extension K* of K’ containing elements 
£i,...,£r such that 

8 = Vi (i = l,...,r). 

Now, since K* is of characteristic p, a simple application of the 
binomial theorem shows that 

(x + b)* 1 = x» + £? = x» + ri i . 
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Hence in K*[x), r 

f(x) = a]J(x p + ?i i ) 

t-1 

= aj_n(«+i-)j P . 

Hence f(x) is completely reducible in K*[x] and has only r distinct 
roots, £ l5 ...,£ r . 

Next, suppose that f'(x) is not zero, and that f(x) and f'(x) have a 
common factor g(x) of degree greater than zero. Since f'(x) is of 
degree n — 1, g(x) is of degree m<n. We have 

f{x) = g(x)h{ x), 

and hence f'(x) = g(x)h'(x) + g'(x)h(x). 

Since g(x) is a factor of f'{x) it is a factor of g'(x)h(x), or else this 
product is zero. Since h(x) is not zero, the product can only vanish 
if g'(x) is zero. The proof given above shows that g(x) is then the pth 
power of a polynomial in some extension K'[x\ of K[x], and hence 
in this extension f(x) has a multiple factor. On the other hand, if 
g'(x) is not zero and g(x) is a factor of g'{x) h(x), g(x) and h(x) must 
have a common factor, since g'(x) is of lower degree than g(x). 
Hence once again f(x) contains a multiple factor. Hence in both 
cases we have, for a suitable extension K', 

/(*) = FI IM*)]” 1 *, 

i 

where some m { > 1. Making a further extension K* of K' so that 
each ffx) is completely reducible in K*[x], we obtain the required 
theorem. 

We conclude this section with 

Theorem II. If K is without characteristic, the roots of an irre- 
ducible polynomial f(x) are all distinct. 

In the root field we may write 

f{x) =aX\{x-i i ), 

i-l 

and we must prove that £ x , ..., are all distinct. If they are not, 
the previous theorem tells us that f(x) and f'(x) have a common 
factor. Since f'(x) is of lower degree than/(a;), and f(x) is irreducible, 
there is no cotaamon factor. K being without characteristic, we have 
excluded the possibility of f(x) being identically zero. 
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6. Primitive elements of algebraic extensions. In § 3, Th. I, 
we saw that any extension of K of finite degree was an algebraic 
extension of K. On the other hand, we saw in § 1 that a simple 
algebraic extension of K is an extension of finite degree. We now 
strengthen this relationship between extensions of finite degree and 
simple algebraic extensions by proving 

Theorem I. If K is a commutative field without characteristic, 
any extension K* of K which is of finite degree is a simple algebraic 
extension. 

Since if* is an extension of finite degree we can write 

K* = K(T v ...,T n ), 

where r 1( ...,r n are algebraic over K. We prove the theorem by 
induction on n. When n = 1 , K(t x ) is the algebraic extension of K 
defined by the characteristic polynomial of r x [§2, Th. I]. We 
therefore assume that the theorem is true for K(t v ...,7^). By 
hypothesis, there exists an element £ of K(t v ...,r n _ 1 ) such that 

K(Q = K(Ti, ..., T n _ 1 ). 

We must show that K(£, r n ) is a simple algebraic extension of K. 
Thus we need only consider the case n = 2; that is, we need only 
prove the theorem for the field K(£,y), and the theorem will then 
be proved by induction. 

Let f(x) and g(x) be the characteristic polynomials of £ and y with 
respect to K, and suppose that in the extension K' of K 


f(x) = (z-£i)... (*-£»), g{x) = (x-yfj ... (x~y m ), 

where we may suppose that £ = and y = y v The result we are 
trying to prove is trivial if either n or m is equal to 1 , so we assume 
that n > 1 , and m > 1 . 

Since K is without characteristic and f(x), g(x) are both irre- 
ducible, the roots £ lf . . . , are distinct, and so are the roots y v -..,y m 

[§5, Th. II]. If fc#= 1, then and therefore the equation 

ii + xy k = ^ + xy (1) 

has, for all values of i, a unique solution x = c ik in K' . The field K 
is without characteristic, and therefore contains an infinite number 
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of distinct elements. We can therefore find in K an element c 
different from any of the solutions 

c fk {i = l,...,n;k = 2,...,m) 

of the equation (1) which happen to be in K. For this value of c, 

£i+ c Vk * i+cy (* = i, •••»»; k = 2, 

Let 6 be the element in K{ £, ij) defined by 

6 = i+C7j. 

Since 6 is in K(£, tj), K(6) lies in K(£, tj). We prove, conversely, that 
K(£, rj) lies in K{6), and therefore that the two fields are the same. 
We know that . . . 

g(y) = o, 

and f(£) =f(d-crj) = 0. 

Writing f(6 — cx) = h(x), 

the polynomial h(x) hes in K(d) [a;]. Also, the polynomials h(x) and 
g(x) have the common factor x — r/ in K'\x\. This is also the highest 
common factor, for if x — ij k (k > 1) were a common factor, 

HVk) =f(0-cVk) = 0, 

and for some value of i we should have 

and therefore 6 — + crj k , 

contrary to the definition of c. The polynomials g(x), h(x) of K(d) [a:] 
therefore have x — r/ as their highest common factor in K'[x]. 
Their highest common factor in K{6) [a;] is therefore 

ax+b = a(x — 7j) (a and b in K(6)). 

It follows that 7/ is in K{6). But 

I = e-c V , 

and therefore £ is also in K{6). Hence K(£,rj ) is in K{6), and the 
theorem follows. 

An element £ of an extension K* of K is called a primitive element 
of K if K* = K(£). Any extension of finite degree of a field without 
characteristic therefore contains primitive elements. 
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7. Differentiation of algebraic functions. In § 5 we defined the 

partial derivatives of a polynomial in K[x x , . . . , # n ] with respect to 

x x , ...,x n . Assuming that the ground field K is without characteristic 

we now show how to extend the definition so that we can define the 

partial derivatives of an algebraic function y oix x , ...,x n . We shall 

then obtain an important criterion for the algebraic independence 

of a given set of elements of a given function field. 

Let K* be an algebraic extension of K[x x , ...,aj n ], and let y be 

any element of it. As in § 3 ,7] is said to be an algebraic function of the 

indeterminates x x , x n . By § 3, Th. VI, there exists an irreducible 

polynomial/^, ...,x n ,y), unique save for a non-zero factor in K, 

such that . 

J\ x 1> x n> V) ~ u - 

Any polynomial g(x x , ...,x n ,y) such that 

g(x v ..., y) = o 

contains f(x x , ...,x n ,y) as a factor. Now, x x , ...,x n are algebraically 
independent over K. The polynomial f(x v ...,x n ,y) therefore 
contains y, and, since K is without characteristic, 


We write 


-f(x 1 ,...,x n ,y)*0. 


fyf(Xv;X n> 1)) 


for the result of replacing y by rj in ■■■> x n >y)- This latter 

polynomial cannot contain/^, ...,x n ,y) as a factor, and therefore 

g 

az/(*i, • ••,*», 7) + 0. 


With these preliminaries, we define the partial derivatives drj/dXi 
by means of the equations 

^f(x 1 ,...,x n ,y)^ + ^f(x 1 ,...,x n ,rj) = 0 (i = 1 

We note that if F(x v ...,x n ,y) is any polynomial such that 

F{x x , ...,x n ,7}) = 0, 

F(x v ...,x n ,y) = 0(x v ...,x n ,y)f(x v ...,x n ,y), 


then 
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and therefore 

d 3 n . , 

^F(x 1 ,...,x n ,y)^l + ^F(x 1 ,...,x n ,y) 

= f( x V 2 /) ^(* 1 . ■■■’ X n>y)^~ i + ^ { G ( x l> •••»*„. */) J 

+ <?(*!, • • • , x n , y) ^Jyf( x v • ••»*»» S') |^ + ^rf( X v ~-> x n> */)] • 

It follows that 

| f (*„ . . . , *„ v) ^ *■<*.. - o. 

In the particular case in which y lies in i^aq, . . £ n ),our definition 
of dyldxi coincides with that given in § 5. For if 

g(x v ...,x n ) 

V h(x v ...,x n y 

where both g(x v ...,x n ) and h(x 1 ,...,x n ) are polynomials without 
common factor, the characteristic polynomial of y is 

f(x v ...,x n ,y) = h(x 1 ,...,x n )y-g(x 1 ,...,x n ), 

and our present definition leads to 

• • •> x n) ^ 9( x i> ••.,*„)- 9( x v ■ ■ ■ > x n) h ( x v •••> * j] 

x[h{x v ...,x n )]-*. 

Since the dimension of K * is n, a minimal basis is formed by any 
n algebraically independent elements of K*. Let £ x , be n 
such elements. Then, by § 3, Th. VI, there exists a unique irreducible 
polynomial F(x v ...,x n ,y) such that 

F(£, v y) =* 0. 

As above, we show that 

•••> £n> v) + 

and define dy/d£ { by the equations 

|><& u, v) J| ■ + ^ m i„, v) - o. 

We wish to establish a relation between the partial derivatives 
dy/dxi and dy/d^, but we must first prove a lemma. 


HP I 


9 
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Let & .-.fir be r elements of an algebraic extension of a field K 
without characteristic, and let f t (x) be the characteristic polynomial 
of with respect to K. We prove that 

IfF(x u ...,x r ) is any polynomial in K\x lt x r ] stick that 

then there exist polynomials A(x v ... y x r ), B t (x lt ...,x r ) in K[x lt . . . , x r ], 
where A (f v . . . , £ f ) 4= 0, such that 

r 

A(x lt ...,X r ) F(x v ...,x r ) = S Biix^ •••,*,■) /i(*i)- 

i-1 

This result has been proved already when r = 1, being implicit in 
§ 2, Th. I, and we proceed by induction, assuming its truth when 
r = s— 1. In place of if, we consider the field K{£ s ), which is a simple 
algebraic extension of K. Over this new field the characteristic 
polynomials/^*) (i = 1, 1) of £ x , may be reducible. 

But since K(£ s ), with K, is without characteristic, only one irre- 
ducible factor of each characteristic polynomial f t (x) can become 
zero for the specialisation *->£ f . We write 

where ^(x^x) and rjr^x g , *) are polynomials in K[x s , *], <pi{i 8 , x) is 
irreducible in K(£ 8 ) [*], and 

UU.Zi)~ 0. 0 (<-1, 1). 

By hypothesis, there exist polynomials 

a(x i, . . . , x g _ lt £ s ) and &*(*!,..., x 8 _ u i B ) 

in K{Q [* 1( . . . , x a _j] such that a(£ x , . . . , g 9 _ x> Q * 0, and 

8-1 

a(x x , . . .,x 8 _ x , E, a ) F(x x , . ..,x s _ x> £ s ) = S bi(x v x s _ lt £ g ) ^(£ s , ** ). (1) 

i = 1 

Since K(£ s ) is a simple algebraic extension of K we may assume 
[§ 1 , ( 1 )] that a(x v ...,x s ) and 6 i (* 1 , . . . , x 8 ) are in K[x u ...,* g ]. Now 
multiply (1) by appropriate factors, so that each term &(£„,**) is 
replaced by f t (x { ). Then (1) becomes 

fll *i)l «(*i. • • •. * 8 - 1 . U F ( x i> • ••. *8-i. U 

1<-1 t 8-1 

= 2 B^x^ .... x g _ x , £ g ) (2) 

where < ” 1 

-®i(*i» • • •» *8—i» £«) ” ^i(* i. •••» *8—i» Ss) 0 f y(^8» *®y) 

(j = 1, — l,i + l, 1). 
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If we write 8 _i 

A(x ly ...,x 8 ) = a(x v . . ., x 8 ) n 'hi x s > *<)> 

i =*1 

we have , , „ _ . y _ v 8 ~ 1 , . ~ >. v , ^ 

i=l 

Now, the polynomial 


s — 1 


A{x v ...,x s )F(x v ..., x 8 _ lf xJ-XBiix x 8 ) /fa) 


i =1 


( 3 ) 


vanishes (by (2)) for the specialisation 


£«• 

This must be true for the coefficient of each power-product in 
x v ...,x 8 _ v Hence, the coefficient of each power-product must be 
of the form d(x 8 ) f 8 (x s ). Therefore the polynomial (3) must be 
equal to B s (x v ...,x 8 )f 8 (x 8 ), where B s (x v ...,x 8 ) lies in K[x v ...,x 8 ]. 
Therefore 

8 

A(x v ...,x 8 ) F(x v ...,x 8 ) = 'ZB i (x 1 ,...,x s )f i (x i ), 

i =i 

and our lemma is proved. 

Now let i) v ...,ri n be n elements of an algebraic extension K* of 
K(x v ...,x n ) which are algebraically independent over K, and let 
£ be any other element of K*. By §3, Th. VI, there exist unique 
irreducible polynomials f(x v ...,x n ,z), g(y v ...,y n ,z) such that 

f{x 1 ,...,x n) Q = 0 and g(y v ...,y n ,Q = 0, 


and also unique irreducible polynomials f t (x v ...,x n ,y f ) such that 

U x v -, x n>Vi) = 0 (* = 1 , .... 70 - 

In the lemma proved above, take the field K(x v . . . , x n ) in place of K, 
and the field K* in place of the extension field of K. Then we can 
find polynomials 

A{x,y,z) = A(x v ...,x n , y v ...,y n , z), 

B i (x,y,z) = B t (x v ...,x n , y v ...,y n , z), 

G(x,y,z) = C{x v ...,x n ,y v ...,y n , z) 

such that 

A(x,y,z)g(y u ...,y n ,z) 

n 

= S B t {x, y, z) ft(x v ...,x n ,y t ) + C(x, y, z) f(x v ...,x n ,z), 

<-l 

where A (x, t/ 9 £) 4= 0, 


9*2 
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Differentiating this equation with respect to x { , yp z, and substituting 
y i for y } , £ for z, we obtain the equations 

0 = y , £) y t ) + G(x, v , Q ~f(x, £), (*) 

A(x, V, C) fij- 9(y, 0 = B^x, y, £) ^ffa yj), (5) 

A(x,y,0~9(y,0 = C(x,y,Z)^f(x,Q. (6) 

rs 

Since A(x,y,£)^ 0 and ^9(y> £) 4= 0, it follows that C(x, y, £) =1= 0. 

0 

We know already that ^/(x, £) 4= 0. Now, since f(x, £) = 0, 

= B t (x, y, 0 ^-/,(x, y,) j C{x, y, £) ^ f(x , £) 

= ^2 £<(*, i /> 0 %) / j4 ( a: > 0 0 

[4 *»• o] *>/§>■ • 

using (4), (6), and (5) respectively. But this last expression is 

t Zhdy t dx { ' 

Hence we have proved 

Theorem I. // y v ...,y n is any minimal basis for an algebraic 
extension K* of K(x v x n ), and £ is any element of K*, then 

K = y K*!i 

dx t j-idyjdXi 

Since the particular basis x v ...,x n for K* has no special signi- 
ficance, this theorem gives a rule for determining the partial 
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derivatives of £ with respect to any base 9/ 1 , ...,y n when they are 
known for any other base x v ..., x n . In particular 


Hence we have a 

Corollary. The matrices 


dx * - x v 

i} fc-i dy k faj * 


dx. 


© - & 


are inverses of one another. 

The partial derivatives ( i = 1 , . . . , n) are defined only when 

the minimal basis rj v ...,ri n is given. If rj v ...,rj r are any elements 
of K* and £ is any other element of K* algebraically dependent 
on them, the irreducible polynomial f(y u ...,y r ,z) such that 

f(y,C)=f(yi>-,Vr,£) = o 


we 


may not be unique. If ^ = |j^/(2/> zljj,,-.,, i® not zero, and if 
define formally by the equation 

0 £^’ ® dy t + df/to' ® = °’ 

the proof of Th. I, which does not make use of the fact that 
r/ v are algebraically independent, shows that 

K = yK d Ji 

dx { j-ityj dx i ’ 


This result has some importance in practical problems. 

Now let x v ...,x n be any minimal basis for an algebraic function 
field, and let ij v ...,y r be r elements of the function field. The rxn 
matrix 

is called the Jacobian matrix of the functions i/ 1 , ..., T) r with respect 
to x lt ..., x n . When r = n the determinant of the Jacobian matrix 
is called, simply, the J acobian of r\ x , ...,rj n with respect to x v ...,x n , 
and is denoted by 

d(7i >—>Vn) 
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Theorem II. If (x v ...,x n ) is a minimal basis for an algebraic 
function field K*, and Vi>--->Vn are n dements of K*, a necessary and 
sufficient condition for i/ v ...,ri n to be algebraically independent is 

d(Vi>->Vn ) ■ 0 
d(x v ...,x n y 

(i) Suppose that i) v are algebraically dependent. Let 

f(y v ..., y n ) be a non-zero polynomial such that 

f(V ?») = o. 

At least one of the ..., y n is algebraically dependent on the rest. 

After rearrangement, let this be i) n . Then we may assume that 
f(Vi> •••> Vn-vVn) is a non-zero irreducible polynomial in 

K[V i> -,Vn-i\[y n l 

and since K(ij v ..., i/ n _ 1 ) is without characteristic, 

0 

fy-J(Vv~;Vn-vVn)±0- 

Also, f(rj v t] n ) is the zero of K*, and therefore 

0 


But 


y K d Jk = K = o a = i 

jZidrtjdXi 


dx { 


n), 


and, since not all dfjdrjj are zero, it follows that 


det 



= 9(^i, —,?n ) = 0 

d(x t ,...,x n ) 


(ii) Now suppose that rj v ...,i/ n are algebraically independent. 
Then we may take them to be a minimal basis for K*, and dxfdijj 
is uniquely defined. By the corollary to Th. I, 

d(?/i. •••>•>/,») dfr »n) = j 

d(x v ...,x n )d(rj 1 ,...,ii n ) 

d{Vu—>Vn) . 0 
x n y 


and therefore 
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More generally, we can prove 

Theorem III. If (x v ...,x n ) is a minimal basis for an algebraic 
function field K*, and rj v ..., rj r are r elements of K*, a necessary and 
sufficient condition that ij lt be algebraically independent is that 
the Jacobian matrix of with respect to x lt ..., x n be of rank r. 

Suppose that the Jacobian matrix is of rank r. By II, § 3, Th. IV, 
this matrix can be augmented to a regular nxn matrix, the elements 
in the part added being 

Kl (P = r+l,...,n;j = l,...,n), 

where (i r+1 , is a derangement of the set (r+ 1, Hence, 
by the above theorem, the elements 


Vi ,—>Vr> x i f+l >-> x in 

are algebraically independent. Therefore the subset ...,7j r is a 

set of algebraically independent elements. 

To prove that the condition is necessary, we assume that rj 1 ,...,rj r 

are algebraically independent. Then r^n, and by the Exchange 

Theorem for algebraic dependence [§ 3, Th. V], we can find n — r of 

the z’s, say , . . 

Vp = x i P {p = r+l,...,n), 

such that the set (i/ i , ...,rj n ) is a minimal basis for K*. Then 


dfai»— »7n) ■ 0 


and the rows of the matrix 



must be linearly independent. In particular, the first r rows are 
linearly independent, and the rank of the Jacobian matrix of 
i with respect to x v ...,x n is r. This completes the proof. 

Finally, we note the following theorem, which is an immediate 
consequence of the results proved in Th. I: 

Theorem IV. If K, .••,*„), (y v ■••,y n ), ( z v are three 

minimal bases of an algebraic function field K*, the Jacobians satisfy 
the relations 

d(gj> —,*n) = 3fri» •••> z n) 
d(x v ... ,x n ] ) d(y 1} ...,y n ) d(x lt 

Vn ) a? n ) = j 

d(x 1 ,...,x n )d(y 1 ,...,y n ) 
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8. Some useful theorems. We conclude this chapter with the 
proof of some theorems which are of use in later chapters. 

Theorem I. If K is any commutative field containing an infinite 
number of elements, and if 

H x \ >•••»*„) (i = l,...,r) 

are r non-zero polynomials in iffaq, ...,a;J, there exist elements 
c 1# ...,c n in K such that 

•••> c n)4 : 0 (i = 1, ...,r). 

We need only consider the case r = 1, for if 

r 

f (*!,...,*») = n 

i =1 

then ...,c n ) + 0 

implies ...,c n )4=0 (i=l,...,r). 

We prove the theorem for a single polynomial f>(x lt ...,x n ) by 
induction on If n = 1, let ^(ajj) be of degree m (m > 0), that is, 

0(*i) = d 0 a;f + d 1 xf- 1 + ...+d m (d 0 4=0). 

By § 4, there is an extension if* of K in which ^(aq) is completely 
reducible That is, 

#*i) = <*o(*l-£l)-(*l-£m)> 

where are uniquely determined elements in K*. Since 

K c K*, and K contains an infinite number of elements, there is an 
element <q in K not equal to any one of the set £ x , ..., £ m . Hence 

?H c i) = d 0 (ci — £ x ) • • • (c x — E, m ) 4= 0, 

since c x 4= £f, and there are no divisors of zero. 

We now assume that the th eorem is true when n — 1 indeterminates 
are adjoined to K. Let 

<f>(x v ...,x n ) 

= M x 1. — > x n-l) x n+<f>l( x V ■■■> x n-l) x n~ 1 + ••• +<f>m( x l, — >* n -l) 

be a non-zero polynomial in K[x v ...,x n ]. We may suppose that 
&oi x i» • • • * x n—i) 4 s 0. By hypothesis we can therefore find elements 
c x , . . . , C n _i of K such that <f> 0 (c v ...,c„_ 1 )4=0. Now apply the proof 
for the case n = 1 to the polynomial (f>(c v ...,c n _ v x n ), and the 
result follows. 
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Corollary. The theorem is true if K is any commutative field 
without characteristic. For K then contains the field of rational 
numbers, and therefore has an infinite number of elements. 

Theorem II. If K is a commutative field without characteristic 
which is algebraically closed, <f>(x) any polynomial in K\x\ such that 
<f)(0) 4= 0, and if r is a given positive integer, then there exists a poly- 
nomial f(x) such that [f(x)Y — x is divisible by <f>(x). 

Since K is algebraically closed we can write 

<f>(x) = a n (* - ii) 8i (i t in K), 

<=■ i 

where f x , ..., are assumed to be distinct. Let polynomials 
gi(x), ...,g k (x) be defined by the equations 

m - Sf [» - + 4; < ( * • - W • + ^ (*- i S<> 2 


+ 


(l-r)(l-2r)...(l- a< -2r) 



where denotes an element of K such that (Q lr ) r = £ t . Such 
elements exist, since K is algebraically closed. By direct calculation 
it may be seen that \gi(x)) r — x is divisible by (x—iff*. Now write 


<f>(x) = (x-iffifax). 


so that <f>i(x) does not contain x — £ t as a factor. By the division 
algorithm, there exists a pair of polynomials afx), b t (x) such that 

di(x) < Pi(^) J rb i (x)(x-i l ) i i = 1 . 


Then, using this identity, 

9i(x) = gj(x )a i(x)4> t (x) g t (x)b t (x) (x-jfft 
<f>(x) <j>(x) <f>(x) 


We may write 


9i( x ) a i( x ) , gj(x)bj(x) 

(x-OtY* M x ) 


gJx)Oi(x) 

( x ~£i) Si 


= hi(x) 4 




Afx) being a polynomial of degree less than s { , and then 


9i( x )bi(x) 

<f><( x ) 


-hi(x) + 


Bfx ) 

<t>ii x ) ' 
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Hence g { (x) = A^x) &(x) + B t (x) (x - £ f ) 8 <, 

and since [<7i(x)] r — x contains (x—E, { ) e ' as a factor, it follows that 
[.4 < (a;)0 i (x )] r — x contains (x— £*)*< as a factor. Now let 

/(x) = I,A i (x)<p i (x). 
i = 1 

Since Aj(x)$j(x) contains (x — ^) 8 * as a factor for all jAi, and 
[A i (x)^ i (x)] r — x contains (x — ^) s » as a factor, it follows that 
[f(x)T — x contains (x — ^) s « as a factor. By symmetry, this holds 
for all values of i from 1 to k, and therefore [/(x)] r — x contains 

k 

II (*-£<)*’ as a factor. But a is in K, and not zero. Therefore 
i 

[f(x)Y-x contains <j>(x) as a factor. This proves the theorem. 

It will be noted that the degree of f(x) is less than that of (j){x). 
But/(x) is not uniquely defined by the requirements of the theorem. 
We could, for instance, take 

k 

f(x) = 2 e { Ai(x) 0<(x) + r/r(x) <f>{x), 

<= l 

where e lt ...,e k are k elements of K such that e\ — 1, and ifr(x) is 
any polynomial. . 
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ALGEBRAIC EQUATIONS 


1. Introduction. In this chapter we shall be dealing with 
polynomials in several indeterminates, with coefficients in a field 
K. This field will be assumed to be commutative, and to contain 
an infinite number of elements. 

Let f(x v ...,x n ), g(x v ...,x n ), ... be a set of polynomials in 
K\x x , A set of elements (£ x , £ n ) of an extension K* of the 

ground field K is called a solution of the equations 


f(x 1 ,...,x n ) = 0/ 
g(x v ...,x n ) = 0, ' 


( 1 ) 


if 


/(£i> •••>£„) = o, 

9 (£ i > • £») = b, 


It will be observed that x v . . . , x n in ( 1 ) are not indeterminates, but 
unknowns, otherwise (1) would imply that f(x 1 , ...,x n ), ... were zero 
polynomials. Strictly speaking, we should use a different symbol 
for an unknown, but it will save a multiplication of our symbols if 
we use the same letters for unknowns and indeterminates. There 
will be no confusion provided that we observe the following con- 
ventions: if f(x v ...,x n ) is written alone, it is a polynomial in the 
indeterminates x lt ...,x n \ if we write 

f(x v ...,x n ) = 0 

the x v ...,x n are unknowns; and if we wish to express the fact that 
the polynomial f(x ly ...,x n ) is the zero polynomial, that is, that 
f(x v ...,x n ) is identically zero , we shall write 

f{x i,...,x n )= 0. 

The main purpose of this chapter is to find criteria for the exist- 
ence of a solution of a given set of equations. While we shall often 
discover solutions in a field K* which is a transcendental extension 
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of K, our reasoning will show that if a set of equations with coeffi- 
cients in a field K has a solution, the set necessarily has a solution 
in an algebraic extension of K. 

We shall be much concerned with homogeneous equations. The 
equations (1) are homogeneous equations if all the polynomials 
f(x v ...,x n ), g(x v ...,x n ), ... appearing in the equations are homo- 
geneous. The polynomial f(x x , ..., x n ) is homogeneous of degree r 
if, for an indeterminate A, 

f(Ax v . . . , A» n ) a A rf(x v ..., x n ). 


All terms in f(x v ...,x n ) then have the same total degree r. A 

homogeneous polynomial is usually called & form. 

If (£j, ..., f n ) is any solution of a set of homogeneous equations, 

so is (Af x , . . ., A£ n ), where A lies in any extension of K. Thus (0, . . ., 0) 

is always a solution, but we shall regard this as the trivial solution. 

We shall find it convenient to consider two solutions (£ 1; 

(%, . . . , rj n ) which are non-trivial as equivalent solutions if there exists 

a u such that r . . . 

U = Mi (* = !,...,»)• 


Any non-trivial solution of a set of homogeneous equations will be 
identified with all equivalent solutions. 

So far we have said nothing about the number of equations in a 
given set, and have not even assumed the set to be enumerable. 
We shall be concerned with the number of equations in our next 
section. We remark here that there are two ideals associated with 
a given set of equations. First, in the non-homogeneous case, let 
h(x 1} ..., x n ), k(x v . ..,x n ) be two polynomials which vanish for every 
solution (f 1; ...,£ n ) of (1). Then 


= % u = o. 


But if a(x v ..., x n ) is any polynomial in K[x v ..., xf\, the equations 


h(x lt ...,x n )-Jc(x v ...,x n ) = 0 and a(x v ...,x n )h(x lt ...,x n ) = 0 


are satisfied by (§ lf ..., f n ), since 

U = #, and a{£ 1 ,...,£ n )h{£ 1 ,...,£ n ) = 0. 

Therefore the polynomials h(x v ...,x n ), k(x v ...,x n ), ... form an 
ideal [I, §2]. Now consider the case in which the equations (1) are 
homogeneous. Let us write 

h(x v . . . , x n ) = hofa, ...,x n ) + h l (x 1 ,...,x n ) + ...+ h T (x v ...,x n ), 
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where hfx v ...,x n ) is homogeneous of degree i in x lt ...,x n . Since 
the set (1) is a set of homogeneous equations, if (£ x , ...,£ n ) satisfies 
h(x v ...,x n ) = 0, so does (A \ x , ..., Af n ). That is, 

^o(£l> • • • > £n) + 1> • • • > £«) + • • • + 1> • • • » £«) ~ 0* 

Since A may be taken as an indeterminate over K(£ lt ...,£„) it 
follows that 

U = ... = K(k, ...,£») = 0. 

Therefore every polynomial of the ideal is a sum of homogeneous 
polynomials, each of which belongs to the ideal. 

Again, suppose that we may write the equations (1) in the form 

A(*i,— ,*„) = o, 

fi( x l> •'•> x n) = 

• • • 

Consider the polynomials 

2« ( (*i *»)/<(*! 4 

i 

where the afx^ ...,x n ) are in K[x v ... ,x n ], and the various sum- 
mations are all over a finite number of values of i. It is clear that 
these polynomials form an ideal. Also, every polynomial of this 
ideal vanishes for every solution of the equations (1), and therefore 
belongs to the first ideal. In the case in which (1) consists of homo- 
geneous equations we may write 

Ui(x i, ...,x n )= a oi (x v . . ., x J + . . . + a rii (x v . . . , x n ), 

where a ji (x v x n ) is homogeneous of degree j, and hence 

■■■> x n)fi( x 1» ~> x n) ■•■> x n)fi( x l> ■■; x n)> 

i i 

where each term in the summation is homogeneous and belongs 
to the ideal. 

The two ideals we have associated with the set of equations (1) 
do not necessarily coincide. To take a very simple example, let 
the set (1) be simply 2 _ 

— U, 

Then the polynomial x x lies in the first ideal, but not in the 
second. 
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2. Hilbert’s basis theorem. We now show that in considering 
sets of equations, homogeneous or not, it is sufficient to confine 
ourselves to sets containing a finite number of equations. We first 
consider non-homogeneous equations, and then deduce the result 
for homogeneous equations. 

Theorem I. In any non-vacuous ideal i in K[x v ...,£ n ] there 
exists a finite number of polynomials gfix v ...,*„) (j = 1, ...,r) such 
that any polynomial F(x v ..., x n ) in i can be written in the form 

r 

F(x i, . • • , x n ) s 2 a { (x v ...,x n ) gi(x v . . . , x n ), 

i-1 

where the afix^ ...,x n ) are in K\x v ..., «„]. 

Such a set of polynomials g t {x v ...,x n ) is said to form a basis for 
the ideal i. It is evident, conversely, that if the g t (x v ...,x n ) are in i, 
and the a i ( x v ...,x n ) in K[x v ...,x n ], then F(x 1 , ...,x n ) is in i. 

We prove the theorem by induction on n. If n = 0, the result is 
trivial. In this case K\x x , ...,a : n ] = K. If i contains only the zero 
of K there is nothing to prove. If i contains a non-zero element a 
of K it contains any other element ft of K, since ft = {ftor 1 ) a. 
Hence, to prove the theorem, we need only take g x = a. 

Now suppose that the theorem has been proved for ideals in 
K[x v We write the elements of i in K[x v ...,x n ] as poly- 

nomials in x n , with coefficients in K[x v ..., x n _ft[. We consider the 
coefficients of the highest powers of x n in the various polynomials, 
that is, the leading terms of each polynomial. We write 

F(x i, . . . , x n ) =Mx v . . . , x n _f) x a n +f 2 (x v x^- 1 + . . . , 

and prove that the polynomials f 1 (x v ...,a; n _i) form an ideal in 
K[x x , ■■■ > x n _ x \. To prove this, let 

0(x i, • • • , x n ) s g x (x v . . . , x n _ x ) xi + g 2 {x x , . . . , x n _ x ) x^- 1 + . . . 

be another polynomial in i, and suppose that ft. Then 

F(x v ...,x n )~ xl~fi 0(x 1 , ...,x n ) 

= •••> »«-l) “ gi(Xl, ■> »n-l)] < + • • • 

is a polynomial in i with leading coefficient 

fftx x , •■■,x n _f) — g 1 (x v ...,x n _ x ). 
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Also, if a(x v is an y element of K[x v •••> x n-i\> then 

a{x v ■■■, x n -i) F( x v -.;X n )=a(x v ..., x n _ x )f x (x v x ■ n _ 1 ) a% + ... 
is a polynomial in i whose leading coefficient is 
a(x 1 ,...,x n _ 1 )f 1 (x v ...,x n _ 1 ). 

Hence, from the definition of an ideal, we deduce that the leading 
coefficients of polynomials in i, together with zero, form an ideal in 
K[ x v By hypothesis, there exists in this ideal j a finite 

set of polynomials 

•••> x n- 1 ) (i = 1, 

such that any element of j can be written as the sum 

r„ 

i, •“> x n—l) * * * > X n — l ) j 

i = 1 


where the a i ( x v ...,x n _ 1 ) are elements of K[x v ...,x n _ 1 ]. Since 
$i(x v • ••,x n _^) is in j, there is a polynomial g i (x 1 , ...,x n ) of i which, 
considered as a polynomial in x n , has x v ..., x n _j) for its leading 
coefficient. Select polynomials g i (x x , ...,x n ) corresponding to the 
r 0 values of i considered, and let N t be the degree of g t (x v x n ) 


in x n . Let 


N = max {N v ...,N ro ). 


Then if F( x v x n ) =f{x v x n _ x ) ** + ... 

is any element of i, of degree oc^N,we can write 

n 

f( x v a S «<(*!, •••»«„-! )H X V 

U 

and F(x v . ... * n ) - 2 a { (x v x n _ x ) x^tg^, ...,x n ) 

i = 1 

is a polynomial in i of degree less than a. Repeating this argument 
at most a — N further times, we show that there exist polynomials 
b { (x v ...,z n ) such that 

r. 

F{x v S H x V -> x n)9i( x V-, x n) = M x l, •••»«») 

i-1 

is a polynomial in * of degree less than N. 

We therefore consider the polynomials in i of degree less than N 



144 


IV. ALGEBRAIC EQUATIONS 

in x n . We show that the coefficients of a^ -1 in these polynomials 
form an ideal j t in K[x x , . ..,x n _y]. Let 

h{x v = h x (x x , . . . , x n _j) x%-' + . . . , 

and k(x v ...,x n ) = k x ( x x , x n _ x ) a^" 1 + . . . . 

Then 

h(x v ...,x n )-k(x v ...,x n ] ) 

= [^(ajj, a: n _i) k x (x x , ...,x n _-^j\x^ 1 + 

and 

a{x v ...,x n _ 1 )h{x 1 ,..., x n ) 

= Cl(x i, • • • , X n _ x ) h x (x x , • ••,a: n _ 1 )a^' _1 + ••• 

are polynomials in * of degree less than N, and their leading co- 
efficients are 

hi( x v ...,x n _ i) k x {x x , x n _ x ), o>(x x , ...,x n _ x )h x ( x x , ... ,x n _ 1 ), 
where a(x v ...,x n _ x ) is an element of i£'[a: 1 ,'...,a; n _ 1 ]. The leading 
coefficients therefore form an ideal j v which, by hypothesis, has 
a basis . . , . . , 

Vr„+l\ x V •"> x n-V> •••> Yn( x l> ■••> x n-l)‘ 

If the polynomials of degree iV - — 1 whose leading coefficients are 
these <p ro+i (x x , ...,x n _ x ) are 

9r 0 +i( X l> • ■ • ■ • , X n) = K+i( X 1> • ■ • ■ • . X n-l) + • • • , 

we can write any element of i of degree N — 1 in the form 
h{x v . . . , x n ) s h x (x x , x n _ x ) x% _1 + ... 

ri-r 0 

= S «r 0 +i(«l x n-l )<f>r 0+ i( x l x n-l) + • • •, 

l s=s 1 

and therefore 

r,-r, 

h{x x ,...,X n )~ S ®r„+f(®l> ’ • ' > x n—l) 9r 0 +ii x V •••> x n) 
i ~=1 

is an element of i of degree N — 2 at most. Repeating this process 
N — 2 times at most, we obtain a finite number of polynomials 
g i {x v ...,x n ) (i = 1, ...,s) in i such that for suitable a i (x 1 , ...,x n ) in 
K[x v ..., a: n ] we find that 

8 

F{x x , ...,X n )-S«i(* x, ..., x n)9i( x V —> X n) 

i=l 

is a polynomial of degree zero, and a further application of the 
hypothesis of induction shows that 

r 

F(x x , . . . , x n ) a 2 a t ( x x , ...,x n )g i (x x ,...,x n ). 

i-i 

This proves Hilbert’s basis theorem. 
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Now suppose that we are given an enumerable sequence of 
equations 


f j (x 1 ,...,x n ) = 0 (j = 1,2,...). 
We have seen that the polynomials 

Za j (x v ...,x n )f j (x 1 ,...,x n ), 


( 1 ) 


where the summation is over a finite number of values of j, form 
an ideal i. This has a finite basis consisting, say, of polynomials 

g k {x v ...,x n ) (k = l, 

Since each of these polynomials lies in i, 

g k (x v ...,x n )=Za kj (x v ..., x n )f. )(x v ...,x n ), 

where the summation is over a finite number of values of j. There 
exists, therefore, a finite number s such that 


8 

*,)/j(*i (fc= l,...,r). 

i = l 

Now, any polynomial of the sequence (1), say f t (x v ...,x n ), is expres- 
sible in terms of the g k (x v that is, 

r 

ft( x v ••• 9 x n ) s X b tk {x v ...,x n )g k (x i, ...,x n ) 

k= 1 
8 

= S C tj( X V • x n)fj( x 1> • • • > x n)> (2) 

i “1 

r 

where c (j (x v ...,x v ) = S •••.«»)• 

* = i 


From (2) we see that in the sequence of equations (1) we may con- 
sider only the first s of them, the remaining polynomials fj(x v ...,x n ) 
belonging to the ideal defined by the first s, and vanishing at all 
common zeros of these s polynomials. 

The argument given above holds whether the set (1) is homo- 
geneous or non-homogeneous. 

Another application of the basis theorem is to the ideal defined 
by polynomials which vanish for every solution of a given set of 
equations. Let gj (x v ...,x n ) (j= l,...,r) , 

be a basis for this ideal. Then 

g j (x 1 ,...,x n ) = 0 (j = 1, ...,r) (3) 

is satisfied by all the solutions of the given set of equations. If 

f(x l ,...,x n ) = 0 


HP I 


TO 



146 


IV. ALGEBRAIC EQUATIONS 


is any equation of the set, f(x v ...,x n ) is in the ideal we are con- 
sidering, and therefore 

r 

f(x v . . . , x n ) = S •••>*„) • ••,**)• 

i -1 

Hence every solution of the set (3) is a solution of the set of equations 
considered. The two sets of solutions therefore coincide, and we may 
replace the set of equations by the finite set (3). 

If the given set of equations is homogeneous we can replace it 
by a finite set of homogeneous equations. For we saw in § 1 that if 
we write 

9j{x i. •••»*«) = g oj (x i, g n /x 1} ...,x n ), 

where g^(x v . . . , x n ) is homogeneous and of degree i, the homogeneous 
polynomials (or forms) g i} (x v ...,x n ) belong to the ideal. Hence, the 
set of homogeneous equations can be replaced by the finite set of 
homogeneous equations 

■••’ x n) = 0 (* = 1» —>»*; j = 1, 


3. The resultant of two binary forms. The results of the 
previous section enable us to confine our study of systems of alge- 
braic equations to finite systems. We proceed to investigate the 
conditions that a system of equations may have a solution. 

While we are ultimately concerned with equations whose co- 
efficients lie in the given ground field K, it is essential to consider, 
as part of our method, equations with coefficients which may belong 
to an extension K' (possibly transcendental) of K. If the coefficients 
of the various equations in the system are a,/?, ...,£, the elements 
of K' obtained by applying the operations of addition and multi- 
plication to a, 8 and to the elements of K lie in an integral 
domain which is a subring of K ' . We call this subring of K' the ring 
of the coefficients, and its quotient field is the field of the coefficients. 

An extreme case is that in which the coefficients are all indepen- 
dent indeterminates over K. The ring of the coefficients is then 
K[a,fi, ...,<?], while the field of the coefficients is K(a,fi, ...,8), the 
field of rational functions of these indeterminates. The system of 
equations will be called a system with indeterminate coefficients in 
this case. 


The idea of the specialisation of polynomials has been described 
ini, §5. Let = 0 (* = !,...,*) (1) 


be a set of equations with indeterminate coefficients. We obtain a 
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set of polynomials in the coefficients (that is, elements in the ring 
of coefficients) which has the following property: a necessary and 
sufficient condition that the set of s equations 

ff{x v ...,x n ) = 0 (i = 1, (2) 

obtained from (1) by the specialisations 

fi(x 1 ,...,x n )-+ff(x v ...,x n ) 

should have a solution is that the corresponding specialisations of 
the set of polynomials all vanish. 

As a first step in our reasoning, and as an illustration of the 
method, we consider the case of a pair of homogeneous equations 
in two unknowns, 

f(x 0 ,x 1 ) = 0, g{x 0 , a; 1 ) = 0. 

In the case of a single non-homogeneous equation 

f{x) = 0 

it has already been shown [III, §4] that there is an algebraic 
extension of the field of coefficients in which f(x) is completely 
reducible, so that 

f{x) = a(x — aj ... (x — a m ). 


If K* is any other extension of the field of coefficients in which f(x) 
is completely reducible, and if 

/(*) = a{x — ... (a; — £ m ), 

we know that K(£ v . . . , £ m ) and K(a v ...,a m ) are equivalent exten- 
sions of K, in which and a t correspond if the factors are suitably 
arranged. For our present purpose we need not distinguish between 
f* and a i . Hence we can say that the only solutions of f(x) = 0 are 

*1, • ■ • > ®nr 

If f(x 0 , x^) is a form, that is, a homogeneous polynomial of degree 
to, we can write 

f{x 0 , xj s a(/? 1 .r 1 - a^o) . . . {P m x x - a m x 0 ) 

in some algebraic extension of the field of coefficients, and the only 
solutions, in the homogeneous sense, of fix^xj = 0 are 

{fill ^l)» •••» {fimf^m)’ 


10-2 
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Now consider a pair of forms in x 0 , x x (usually called binary 
forms) in which all the coefficients are indeterminates over K: 

f{x 0 , x x ) ==a 0 x? + a x + . . . + a m xtf,\ 

g( x o> x i)= b o x i +b 1 x?~ 1 x 0 +...+b n x%. 

Then 

a 0 xj , + n “ 1 + . . . + a m x^~ 1 Xo l = x i~V( x o> x i)> 

a 0 x?+ n ~*x 0 + ...+ a m x?~*x% i + 1 = x?-*x 0 f{x 0 , x x ), 

*•••••* 

Oo^f^o -1 + — + “mC " 1 = x 0 ~V( x o> x l)> 
box?*"- 1 + . . . + b n x?-'x% = x?-ig(x 0 , x x ), 

b 0 x™ +n ~ i x () + ... + b n x x ~ i XQ +1 = x i~* x og{ x o> x J, 


b 0 x?x%- 1 + ... +b n xy+ n ~ 1 =x^~ 1 g(x 0 ,x 1 ). 
If R is the determinant of the square matrix 



the multiplication of these m + n equations in order by the corre- 
sponding cofactors of the successive elements in a column of the 
matrix (4) gives us the identity 

Ro^xl = A rs {x 0 , xj/fa, x x ) + BJx 0 , x x ) g(x 0 , x x ), (5) 

where A rs (x 0 , ^i) an d B rs (x 0 , x i) are forms of degrees n— 1 and m — 1 
respectively whose coefficients are in the ring K[a 0 , a m , b 0 , . . . , 6 n ]. 
There is one such equation for each pair of values of r, s such that 
r + s = m + n— 1. In particular, 

Hx^+n-i = A{x 0 , x x )f(x 0 , x x ) + B(x 0 , x x ) g(x 0 , x x ), (6) 


when r = 0, s = m + n— 1. 
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Now consider any specialisation in K’\_x 9 ,x^\ of the forms (3), 
where K' is any extension of K. If, for this specialisation, R is not 
zero, the specialised equations 

f( x o> x i) = °> 9( x o’ x i) = 0 

can have no solution. Suppose, indeed, that they have the solution 
(c 0 , c x ), in which c 0 , say, is not zero. From (6) we have the equation 

1 = A(c 0 , C x )/(c 0 , C x ) + B(c 0 , C x ) </(c 0 , C x ) 

= 0 , 

which is a contradiction. 

Suppose, on the other hand, that for some specialisation of the 
forms (3) the determinant R is zero. The rows of the matrix (4), 
considered as vectors, are then linearly dependent. Hence we can 
find elements , , , 

®0> • • • > ®n-l> c 0> c l> • • • » c m— 1 

in K ' , not all zero, such that 

d 0 u 0 + d 1 u 1 + ... +d n _ 1 'M n _ 1 — c 0 v 0 — c x v x — ... = 0, 

where the vectors u 0 , « n _ x , v 0 , v rn _ 1 are the row-vectors of the 
matrix (4). We therefore have the set of equations 

d 0 a 0 -c 0 b 0 = 

<2 0 a x + d x a 0 Cq&i c x 6q — 0, 
dn—l^m c m—l^n ~ 

If d(x 0 , £ x ) = d 0 x x 1 4- . . . + d n _ j^Xq 1 , 

and c(x 0 , x x ) = CqX?- 1 + . . . + c rn _ x a?S* _:l , 

the equations (7) express the fact that 

d{x o, x x )f(x 0 , z x ) = c(x 0 , z x ) g{x 0 , x x ), (8) 

being merely the equations which express the equality on both sides 
of (8) of the coefficients of *f +n_1 , ...,a# l+n-1 . 

Now resolve both sides of (8) into irreducible factors in K’[x 0 , * x ]. 
By the unique factorisation theorem [I, §7], any irreducible factor 
on one side must also appear on the other. Since d(x 0 , a; x ) is of degree 
n— 1, at most, one of the factors of g(x 0 , x x ) must be a factor of 
f(x 0 , » x ). Hence if R = 0 ,f{x 0 , « x ) and g(x 0 , x x ) have a common factor 




150 


IV. ALGEBRAIC EQUATIONS 


of degree greater than zero. Any solution of the equation obtained 
by equating this common factor to zero is a solution of the equations 


f(x 0 , Xj) = 0, g(x o,^) = 0. 


The common factor can be found in K'[x 0 ,x J by the division 
algorithm. The common solution of the two equations is in an 
algebraic extension of K'. 

Calling R the resultant of the two binary forms (or equations), 
we have proved 


Theorem I. A necessary and sufficient condition that two homo- 
geneous equations with coefficients in afield K' have a common solution 
is that their resultant vanishes. The common solution, when it exists, 
lies in an algebraic extension of K ' . 

We may use this result to obtain a theorem for two polynomials 

in K'\x\. Let , 

f(x) = a 0 x m + ... + a m , 


g{x) = b 0 x n + ... + b n . 


Make these polynomials homogeneous by introducing a new in- 
determinate y; we obtain the forms 


f(x, y) = a 0 x m +... + a m y m , 

g(x,y)==b 0 x n + ...+b n y n . 

Let R be the resultant of these two forms. If i?=t= 0, the equations 

f(x) = 0, g(x) = 0 (9) 

cannot have a solution. For if they had a solution x — c, the 
homogeneous equations 

f(x,y) = 0, g(x,y)=*0 

would have the solution (c, 1). 

If R = 0, the homogeneous equations have a solution. Let this 
be (c, d). If d 4=0, this solution is equivalent to (cd~ x , 1 ), and x = cd~ x 
is a solution of (9). But this reasoning fails if d = 0. In this case 
substitution in f{x, y) and g(x, y) shows that a 0 — 0, b 0 = 0. The 
determinantal expression for R makes it evident that if a 0 = b 0 = 0, 
then R = 0. Hence we have 

Theorem II. A necessary condition for the existence of a solution 
of two non-homogeneous equations is R = 0. It is not a sufficient 
condition unless we impose the additional condition that the leading 
coefficients of the equations are not both zero. 



3. THE RESULTANT OF TWO BINARY FORMS 151 


We deduce from (6) that the resultant of two polynomials f(x) 
and g(x) can always be expressed in the form 

B=A(x)f(x) + B(x)g(x), (10) 

where both A(x) and B(x) are polynomials in x with coefficients in 
the ring of coefficients of f(x) and g(x). 

We conclude this section by indicating how an alternative 
approach to the foregoing argument can be extended to give a set 
of conditions to be satisfied when the binary forms f(x 0 , x x ) and 
g(x 0 , Xj) in (3) have a common factor which is at least of degree t. 
If this is the case, 



f{x 0 , x x ) =f x ( x 0 , x x ) d(x 0 , a^), 

and 

g(x 0 , x x ) s g x (x 0 , x x ) d(x 0 , x x ). 

where 

fi(x o, x i) = d 0 xr l + • • • + djn^tX™" 1 , 


9l( x 0> x l) = Co x l + — + c n-t x 0 ~ l - 

Since 

9i( x o> x i)f( x o> *i) =/i(«o. *i) 9( x o. x i). 


we have the equations 

c 0 a 0 — d 0 b 0 = 0, 

C 0 a l + C X°0 ~ ^ 0^1 ~ ^ 1^0 = . 

d m _ t b n = 0. 


( 11 ) 


Hence the rows of the (to + n — 2t + 2) x (to + n — t + 1) matrix 
l a o a l a m 

0 a 0 a i • a m 

. . . 0Q Otj 

b 0 b x . . b n 

0 b 0 b, • . b n 

\ • • • • • b n l J 

are linearly dependent. Its rank is therefore not greater than 
to + w — 2t+ 1. Conversely, if the rank of this matrix is less than 
to +7i — 2t + 2, the rows are linearly dependent, and we can find 
elements d 0 d m _ l , c 0 , not all zero, in the field of the 


°\ 

0 \ 


n — 1 + 1 
rows 


a„ 


( 12 ) 


TO — 1 + 1 

rows 
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coefficients such that (11) is true. Defining ffx^ xf) and g^x^Xj) 
as above, it follows that 

9i( x o> %i)f(Xo, * 1 ) =/i(*o> x i) d( x o> x i)> 

and from the unique factorisation theorem we deduce that/^,^) 
and g\x 0 , x x ) have a common factor which is of degree t at least. 
Hence 

Theorem III. A necessary and sufficient condition that f(x 0 , x x ) 
and g(x 0 , xf) have in common a factor of degree t at least is that all the 
determinants ofm + n — 2t + 2 rows and columns in ( 1 2) should be zero. 

4. Some properties of the resultant. Let a v ff v ...,/? n 

b e m + n indeterminates over the ground field K, and let c v ...,c m , 
d v ...,d n be the elementary symmetric functions of these two sets. 
That is, let 

C 1 — a l+ ••• + a m> ^1 = Pl+-‘-+Pn> 

c 2 = a x a 2 + . . . + a m _ x a. m , d 2 = ff 1 ff 2 + ...+ P n „J n , 

• • • • • • • 

c m = a l a 2 •** a m» *n = P\P%—P»' 

Let a 0 , b 0 be two further indeterminates, and write 

a i = (-l) i a 0 c i (i = 1, ...,m), 
b i = (~l) i b 0 d i (i = 1, ...,n). 

If f(x) = a 0 x m + a 1 x m ~ 1 + ...+a m) 

and g(x) = b 0 x n +6 1 x n " 1 + ... +b n , 

then f(x) = a 0 (x m — c 1 x m ~ 1 + c 2 x m ~ 2 + ... + ( — l) m c m ) 

m 

= n ~ a i)> 

i-l 

n 

and g( x ) = b 0 U( x -Pj)> 

i - 1 

Let R be the resultant of f(x) and g(x). Then R is an element of 
K[a 0 , ...,a m , 6 0 , and we express this by writing 

R — RifliQ, ...,a m , bf), ...,b n ). 

The determinantal expression for the resultant shows that it is 
homogeneous, of degree n, in a 0 , . . . , a m , and homogeneous, of degree 
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m, in b 0 , . . . , b n . For every term in R contains one and only one element 
from every row, and one and only one element from every column 
of the determinant. The principal diagonal of the determinant gives 
the term aft b% as one term in R. We may write, by the above remark, 

K ^») = a 0 K 1 > • • •> i f • • •> t ^n)' 

Now, i?(l, — c v ..., ±c m , 1 , — d v ...,±d n ) is an element of 

K\ct i, ..., ot m , /?|, • • • , /?«], 

and may therefore be regarded as a polynomial in a iy with coefficients 
in This polynomial vanishes 

for each of the specialisations 

since f(x) and g(x) then have a common factor. Hence, since the 
ft 1} ...,/3 n are all distinct, a factor of the polynomial is 

This is true for each value of i, and therefore 

R( l,-c v ...,±c m , 1 ,-d v ...,±d n ), 
regarded as a polynomial in K[oc lt ...,/? n ], contains 

m n 

& = n n (oti-Pi) 

i ■» 1 j ■» 1 

as a factor; or R(a 0 , b 0 , ..., 6 n ) contains 

m n 

s = «n n («« -A) (i) 

<=ij=i 

n 

as a factor. But g{ a i) = b 0 n (a* — fij), 

m m n 

and therefore n r A a i) = KU II ~ /?;)> 

i — \ i =1 i-1 

m 

that is, S = < n ( 2 ) 

i- 1 

Similarly S = ( - 1 b% Tlfifij), (3) 

m m 

f(x) = a 0 n (*-a t ) = ( - l) m a 0 n (a*-*). 

i =1 


since 
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Now (2) can be expanded as a homogeneous polynomial of degree 
m in the coefficients b 0 ,...,b n of g(x). The coefficient of each term is 
a* x(a symmetric function of a lt ..., a m ). Hence [III, §4, Th. V] 
S is a form of degree m in b 0 , ...,b n , with coefficients which are 
polynomials in a 0 ,c v c m . We deduce from (3) that S can also be 
expressed as a form of degree n in a 0 , ..., a m with coefficients which 
are polynomials in b 0 ,d v d n . Comparing these two results, we 
see that S is an element of K[a 0 , b 0 , ..., 6 n ] which is homo- 

geneous of degree n in a 0 , ..., a m , and homogeneous of degree m in 
6 0 , ..., b n . It also follows from (2) that one term in S is a%b%. 

We have already noted that 

■®(®o» ~ ®o c ij •••> ± a o c m> bo, ~bo^i, •••> ± b 0 d n ), 
regarded as a polynomial in a 0 , b 0 , a v ...,a m , fi lf contains S 

as a factor. That is, 

•*») i ^o^m, bo, Mi> * . *, i bQd n ) ST , 

where T is a polynomial in a 0 , b 0 , a v ..., a m , p v ..., /?„ which must be 
symmetric in a x , ..., a ro and in Hence T can be expressed 

as a polynomial in a 0 , b 0 , c v c m , d v ...,d n . By comparing degrees 
on each side and remembering [III, §4, Th. VIII] that a 0 , b 0 , 
c v ...,c m , d v ...,d n are algebraically independent over K, we see 
that T is in K. Hence, expressing S as a polynomial in a 0> a m , 
b 0 , ...,b n , we have a relation 

-RK.«1» •••> ®m> b 0 , * * * > bfi) ST 

between the a t and b } which must be an identity. By considering 
the coefficient of a^b™ on each side we find that T = 1. Hence 

m n m n 

* = < n g(* { ) = (-i) mn bp n/w = <b% n nK-A). w 

i — i / = i 

Now consider any speciahsation 

bj->/i] 

of the coefficients of f(x) and g(x), and let the roots of the specialised 
polynomials be ..., £ m , tj x , ..., i \ n . Then, since the specialisations 

*<-►&, p r a 0 ->A 0 , b 0 -+fi o 
imply the specialisations 

we obtain the equation 

771 71 

R{A 0 ,...,A m ,To,--;^n) = n IKZi-ViY 

i =» 1 J-l 
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We show next that R is isobaric in the o i and b s , of weight mn . 
In other words, if caft a^ ... a% b{’ ...tf* is any term of 


then 


b 0 , •••,b n ) } 

m n 

S r h + E S js — mn - 
0 0 


so that 


To prove this, consider the expressions for R given in ( 4 ). If in 
these we substitute aj = Aa i( /?j = A/^, where A is an indeterminate, 
we obtain 

R = <b% n n K-ft) = a *- oj6g* n n k-A), 

i j _ i j 

that is, I? = A mn R, 

where R is the resultant of the polynomials 

f{x) = a 0 U(x- aD = a 0 U(x- A a*), 

i i 

m = b 0 U(x-P' j ) = b 0 U(x-\P j ). 

j ) 

But these polynomials may also be written in the form 
f(x) =a 0 x m + + A 2 a 2 x m ~ 2 + . . . + A m a m , 

g(x) = b 0 x n +Xb 1 x n ~ 1 +X 2 b 2 x n ~ 2 + ...+X n b n , 

R = R{a o> ^®i> • • • > A m & m , b 0 , A6j, ...,A n b n ) 

= X mn R(a 0 a m , b 0 , ...,b n ). 

This proves the theorem. 

In later applications we shall need to consider the case in which 

the coefficients a^bj are forms in an integral domain K [£ lt ..., £ s ], 

obtained by adjoining the indeterminates £ 1( ..., £ g to K. We suppose 

that the degree of a t is h + i, where h is constant, and that the degree 

of bj is k +j, where k is constant. Since R is homogeneous, of degree 

n, in the a if and homogeneous, of degree m, in the b }) the degree 

of a term > 

ca l 0 ' ...a%bl’ ...b ] n * 

m n 

in R is E(A+0h+E(fc+0i/ 

<-o t -o 

m n m n 

— h 2 it + k 2 jt + S t*i + S tjt 

t - o t —o t -0 t-o 

= hn + km + mn, 

R being isobaric, of weight mn. Hence R is a form in K[£ v ...,£*] 
of degree nh + mk + mn. 
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S. The resultant of a system of binary equations. We consider 
the system of r homogeneous equations 

M x o> x i) = ° (1) 

and suppose, in the first place, that the coefficients are independent 
indeterminates over K. Let fi(x 0 , be of degree m i , and let 
m = max \m v ..., m r ]. Then if 

&(*<). «i) = x i) ,j (i= lf ' r)> 

0r+A x <» X l) = b i x l~ mi fi( x 0> X l) J 

where the a { , b { are new indeterminates, we consider the auxiliary 
system of equations 

<f>i( x o> x i) = 0 (i = (2) 

Clearly if, for any specialisation of the coefficients in an extension 
K' of K, the system (1) has a solution, so has the system (2). Con- 
versely, if the system (2) has the solution (c, d), so that 

a i c m ~ m 'f l (c,d) = 0,1 

= 0,1 

then, since a { , b i are not zero, and c, d are not both zero, it follows that 
fi(c,d) = 0 (i — 1, ...,r), 

and therefore the system (1) has a solution. Thus, in order to find 
conditions for a solution of (1) when the coefficients are specialised 
in K', we merely need to find the conditions for a solution of (2) 
under the same specialisations. We do this by combining the system 
(2) into two equations. 

Let %!,..., u 2r , v v ..., v 2r be 4 r further indeterminates, and let 

2 r 

®( x 0 > x i) = 'Lu i (/> { {x 0 ,x 1 ), 

1 

2 r 
i 

If R(u, v) is the resultant of this pair of binary forms O(x 0 , x x ) and 
W(x Q ,x^), we first note that R(u,v) is a polynomial in u lt u 2r , 
v v ...,v 2r . The coefficient of 

*4' ••• u\"v [ i ... v{" 

is evidently of the form 

D g — d s a { l+il ... a? +ir b[ r+l + ir+l . . . hi' r+iw , 
where d t is a polynomial in the coefficients of the original forms 
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fi( x o> x i)- These polynomials d v ...,d N constitute a resultant system 
of the set of homogeneous equations (1). 

If, for any specialisation of the coefficients, the system (1) has a 
solution, so has the system (2), and hence also the pair of equations 

^(Vi) = 0, ^(x^Xj) = 0. 

Then E(u, v) = 0. This implies that every coefficient D 8 vanishes, 
and therefore, since the a it bj are indeterminates, 

• rfj — 0, dfl — 0. 

Conversely, if each d s vanishes, the forms 
0(x o , x x ), ^P{x 0 , x x ) 

have a common factor. This can be determined by the division 
algorithm [I, § 6], and it is therefore an element of 

K'ia^bpU^vJix o.aq]. 

We may assume that the common factor is in K'[a it bp u 8 , v,] [x 0 , x{\ 
and is not divisible by an element of K'[a it b } , u 8 , vj. Now, 0{x o , aq) 
does not contain v x , ...,v 2r , so that this common factor does not 
contain any Vp and W{Xg, aq) does not contain u v . . . , u 2r , and so the 
common factor is independent of the u t . Hence every common 
factor of 0 (x 0 , aq) and ^(Xg, aq) is independent of u { , Vp and must 
therefore be a common factor of the forms 

(® = 1, ..., 2r). 

The system (2), and hence the system (1), therefore has a solution. 
From § 3, (5), we may write 

R{u,v)x r gx{ = a r8 {xg,x 1 )0(xg,x 1 ) + b rs {xg,x 1 ) ^(a^aq), (3) 

where r + s = 2m — 1, and a r8 (x 0 , aq), b r8 ( x 0 ,x x ) are forms in x 0 , aq 
whose coefficients are polynomials in the coefficients of 0(pc o ,x x ) 
and W{Xg,x^). Remembering that 0(x o ,x 1 ) and W(x 0 ,x x ) are linear 
expressions in the 4 > i( x o> x i)> an d how these are obtained from the 
original forms /*(»„, aq), we obtain, on equating like powers of 
u { , Vp a s , b t on both sides of (3), 

r 

diX'gX 8 !^ 2 b f8>ij (x o, xJfjiXg, aq). (4) 

3 =1 

As in the case of two forms, we can deduce from these results 
others applicable to a system of r equations in one unknown. As 
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before, we make the equations homogeneous, and construct the 
resultant system. Then we show that for specialised coefficients 
the members of the resultant system vanish if the given equations 
have a common solution, but that if the resultant system vanishes 
the equations either have a common solution, or the leading co- 
efficient in each of the equations vanishes. We have proved the 
following result: 

Given r polynomials fi(x), ...,f r (x) in one indeterminate with in- 
determinate coefficients there exists a set d v ...,d N of polynomials 
in the coefficients with the property that for specialisations of 
the coefficients the conditions d t = 0, ...,d N = 0 are necessary and 
sufficient to ensure that (i) the equations fi(x) — 0, ...,f r (x) = 0 are 
soluble in some extension field ; or (ii) the leading coefficients inf^x), ..., 
f r (x) all vanish. Moreover, if the equations have a solution, they have 
a solution in an algebraic extension of the field of coefficients. 

From (4) above we derive important expressions for d v ...,d N . 
Putting s = 0, we find that 

dt^ib^Wx). (5) 

i-i 


Finally, we extend the result proved at the end of §4 for two 
forms f(x 0 , xf), g(x 0 , xf) whose coefficients are themselves forms in 
K[E, ly We consider r forms 

M x o> x i) (* = 1 ,. 

and suppose that the coefficient of x\x{ in f k {x 0 ,xf) is a form in 

Z[g 0 , ..., g g ] of degree h k +j. If h = max [h v ..., hf\, then in the 

preparatory stage of finding a resultant system for the ff,x 0 , Xj) we 

choose a iy 6,- to be forms in g l9 . . ., g s , with indeterminate coefficients, 

of degrees h — hi + m — and h — h i respectively. At this stage our 

forms are , . . . 

<t>A x o. x i) = a i x o- mt fi( x o> x i)> 

<f>r+i( X 0> x l) = b i x ?~ mi M x O, x l)- 


The coefficient of x[ +m ~ mk xl in 4> k { x o> * 1 ) is a form of degree h+j in 
gj, . . . , g g . Hence we may apply the result of § 4 to 

2 r 

®( x 0 ,x 1 ) = 'Lu k (f> k (x 0 ,x 1 ), 

1 


2 r 


l P{x 0 ,x 1 ) = 'Zv k <f) k (x 0 ,x 1 ). 

1 


and 
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The coefficient of . . . uig v{ i ... vfy; in R{u, v), which is 

D t - d t a[ 1+ h ...aj r+i 'b{ r+l+i,+l ... b 1 "**" 

is therefore a form of degree 2 mh+m 2 in £ x , Since the a { , b } 

are forms in £ x , ..., g 4 , it follows that the d t are forms in £ v . . . , 


6 . The resultant system for a set of homogeneous equations in 
several unknowns. We now consider r homogeneous equations in 
7H-1 unknowns 


•••»*»)“ 0 (* = 1, ■■;?), 


(1) 


again beginning with the case in which all the coefficients are in- 
determinates. Let £ 0 , £ x be two further indeterminates, and write 


- Zo x iA = 0 

1 I 


Xf 


71 - 1 ). 


We then obtain from (1) a set of equations, homogeneous in £ 0 , £ v 


fMo x <» •••>£o a 4-i>£i) = 0 (* = !» (2) 

The coefficient of gj? £{ in fi(£ 0 x' 0 , ..., £^x' n _ x , £ x ) is a form in x ' 0 , ..., x' n _ t 
of degree j. If we construct the resultant system of the equations 
(2), using the result of the preceding section, we obtain a set of 
forms in x' 0 , ...,x' n _ v 

d { (x o, ...,a4_i) (* = 


whose coefficients are in the ring of coefficients of (1), with the 
following properties: 

(i) d' i £ , Q l = 2 a#(£ 0 > £i)fj{£o x o> •••> £o x n~i> £i) 

i 

for a suitable integer p { , where the a^(£ 0 , Ii) are forms in £ 0 , g x 
with coefficients in the ring which contains x' 0 , ...,x' n _ v K, and the 
coefficients of the equations (1); 

(ii) if for the specialisations (x 0 , x n _f) of (x' 0 , ^-i)> 

di(x o, = 0 (i = l,...,N), 


the specialised equations (2) have a solution, and conversely. This 
result remains true for any specialisation of the coefficients of (1), 
the coefficients of the forms d^x^, ...,x' n _f) being specialised to 
correspond to the specialisation of (1). 

Now, if the equations (2) have the solutions (c, d) for the special- 
isations (x 0 , of (x' 0 , evidently (cx 0 , ...,cx n _ v d) is 

a solution of the set (1). Conversely, suppose that the set (1) has 
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the solution (a; 0> ...,x n ). If x 0 , ...,x n _ 1 are not all zero, the set (2) 
has the solution (l,x n ) for the specialisation (x 0 , of 

(x' 0 , ...,a4_i). If, on the other hand, x 0 — ... — x n _ x = 0, « n =4=0, the 
equations (2) have the solution (0, x n ) for any specialisation of 
x' 0 , ...,x' n _ v Hence, in this case, 

di(xo, ...,x;_i) = 0 (i=l,...,N). 


We see then that a necessary and sufficient condition that the 
equations (1) have a solution is that the equations 

d{(x o, ■ ••,x n _ 1 ) = 0 (i — 1, ...,N) 

have a solution. Since these are homogeneous equations, and 
x i = £ 0 x'i, this is simply the condition that the equations 

x n- i) = 0 (* = 1, (3) 


have a solution. We see that if the equations (3) have a solution 
which is algebraic over the field of coefficients, so has the system (1). 

As a final preparatory step towards our first theorem, consider 
the reverse transformation connecting x t with x’ it 


A = xdo 1 

£l= x n 


1 


(i = 0, ...,n-l). 


and its effect on the identity 

= 0> £l)/,(£ 0 *0> •••> So^n-l) Si)* 

i 

This becomes 

*„), 

where 6^(a; 0 , ...,x n ) is a form in x 0 ,...,x n ; the coefficients are 
polynomials in £ 0 . On multiplying by a suitable power of £ 0 we 
obtain the equation 

A{x o, •••,«n-l)£™=££o '%(*<>> ■■■> x n)M x o. — 

and on equating like powers of £ 0 we have, finally, 


di(x o, • • . , *„_i) = S c i} (z 0 , ..., x n )fj(x 0 , ...,x n ). 

3 


( 4 ) 


We are now in a position to prove 
Theorem I. Let 

fi( x o >•••>«») = 0 (* = l,...,r) (1) 

be a set of homogeneous equations with indeterminate coefficients, 
and let fi(x 0 , ... ,x n ) = 0 {i = 1, ...,r) (5) 
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be the equations obtained from (1) by a given specialisation of the 
coefficients in (1). Then there exists a finite set of polynomials in the 
coefficients of ( 1), d v ...,d k , with the properties 

r 

(i) d t x g* = S • » x n)fMo> •••>*„) 

j -0 

for some value of m, the coefficients of a y (a; 0 , •••> x n ) being in the ring 
of coefficients of ( 1 ) ; and 

(ii) a necessary and sufficient condition that the equations (5) have 
a solution in an algebraic extension of the ring of coefficients is that the 
specialisations of the polynomials <J f arising from the given specialisa- 
tions of the coefficients should be zero. 

We have proved this theorem when n — 1 [§ 5] and we now proceed 
by induction. Beginning with the set of equations (1), we construct 
the set of forms dfx^, ...,x n _fj which satisfy (4) and have the 
property that the equations (3) have a solution if and only if the 
equations (1) have a solution. This property remains true for any 
specialisation of the coefficients. 

Now consider a set of N homogeneous equations 

Df(x 0 , x n _f) = 0 (i = l,...,N) (6) 

with indeterminate coefficients, D^x^ ...,x n _f) being of the same 
degree as dfx^, ...,x n _f). Applying the hypothesis of induction to 
(6), we construct the set E t (i = 1, of polynomials in the 

coefficients which are such that 

Ei x o l = H l A i j( x (i = 1, 

and which also have the property that a necessary and sufficient 
condition that any specialisation of (6) should have a solution is 
that the corresponding specialisations of the E t should vanish. 

We can specialise the coefficients of (6) in two stages. First we 
specialise the coefficients of (6) so that Dfx^, ...,x n _f) becomes 
d^XQ, x n _f), and then we specialise the coefficients of d t ( x 0 , . . . x n _f) 
by the specialisation which leads from (1) to (6). The first special- 
isation induces a specialisation 

Ei~>d { (i=l,...,k) 

of E t , and we have 

= £<%(a; 0 , . . . , x n _f) dj(x 0 , x n _f) 

= I l I,a ij c jk f k (x 0 ,...,x n ) (7) 

by (4), where is a form in x Q , ...,x n , with coefficients in the 

ring of coefficients of (1). 


HP I 


II 
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When we make the further specialisation which leads from (1) 
to (6) we induce a further specialisation of the d t . The vanishing of 
these is a necessary and sufficient condition for the existence of a 
solution of the specialised forms of (3) in an algebraic extension of 
the field of coefficients; that is, in an algebraic extension of the field 
of coefficients of (6). This, in turn, is a necessary and sufficient 
condition for the existence of a solution of (5) in an algebraic 
extension of the field of coefficients. Our result is therefore proved. 

Moreover, if the set (6) has any solution, it follows from (7) that 
the d t are all zero, and hence that the set (5) has a solution in an 
algebraic extension of the field of coefficients of (6). 


7. Non-homogeneous equations in several unknowns. We 

now consider how we may apply a process similar to that of § 6 to 
test for the existence of a solution of a set of non-homogeneous 
equations 


9i(*v —.*») = 0 (* = 1 . —,*■), 


( 1 ) 


having coefficients in the ground field K . It will be recalled that the 
condition given in § 6 for the case n = 1 was necessary, but was only 
sufficient if we imposed the restriction that the actual degrees of 
the equations were equal to the assigned degrees. We make a similar 
restriction in the present case, but even when this is made we may 
not be able to proceed in the obvious manner. For, let the poly- 
nomial gi(x v -..,x n ) be of degree m i , and let it be of degree when 
regarded as a polynomial in x n with coefficients in K\x lt ...,a5 n _j]. 
Write 

0<(*1. —.*»)= «<(*!. • • • . * n -l) x n‘ + b i( a v *n-l) X n^ + • • • • (2) 


a^Xy, ...,x n _j) is a polynomial in K\x ly ...,a; n _ 1 ] of degree m* — m\. 
Our restriction implies that it is not zero. When we apply the results 
of §6 to (1), regarded as equations in x n , we obtain a set of poly- 
nomials h( \ li-1 8 1 


having the properties 

r 

(i) hi{x lt ..., x n _j) s 2 a ij (x 1 ,...,x n )g j (a 1 , ...,x n ) (i = 1,. 

i “1 

(ii) necessary conditions that the equations 

9i( x l>--: x n-l> x n) = ° (» = 1 > •••>*■) 

(where x i is a specialisation of x { ) have a solution are 
hi{ x \y •••>x n _ i) =* 0 (i — l,...,s); 


( 3 ) 
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(iii) if h t (x 1 , = 0 (i = 1 , ...,3), 

then, either the equations (3) have a solution, or else 

»n-i) = 0 (* = 1, •••, r). 

In order to remove this last possibility we proceed as follows. 
Let u v ...,u n _ 1 be n— 1 independent indeterminates, and write 


Vi — x i~ u i x n (* = 
Vn = «n- 


Then we have ftfe, •••,£«) = ...,*/»), 

where . . . , y n ) is a polynomial with coefficients in K[u lt « n _ 1 ] 

of degree m i in y v ...,y n . In this polynomial y" 1 * appears with 
coefficient g*(u v ...,u n _ v 1), where g*(x v ...,x n ) is the set of terms 
in g t (x Xi x n ) which are of degree m i . We apply the results of § 5 to 
the equation* ^ ( . _ t r)> (4) 


regarded as equations in y n . We then obtain polynomials 

H t (y y n -i) 


with coefficients in K[u u . . such that 


(i) H i (y 1 ,...,y n _ 1 )=z'ZA ii (y 1 ,...,y n )G j (y 1 ,...,y n ) (* = 1, 

3 =1 

(ii) necessary and sufficient conditions for the existence of a 
solution of the equations 

fVn-i>yn) = 0 (* = L -,r), 
are ...,y n -i) = 0 (i = 1, 

In this case the possibility (iii) does not arise since 

• • •> Vn— l) ~ 9i (^1> • • •> ^n—V 1)» 

which is a non-zero element of K[u 1} ...,u n _ J. 

There exist [III, §8, Th. I] specialisations of 

%, such that 

gf(u v ...,u n _ v 1)4=0 (i = 1 ,...,/■), 

and the results just stated hold when u lf are specialised to 

u v ...,u n _ v In what follows we assume that these specialisations 
have been made. 


11*2 
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Any solution ( x' v ...,x' n ) of the equations (1) leads to a solution 
(x^ — u^x' n , ...,x' n ) of (4) and hence to a solution 

K •••. K-i - K) 

Of H i (y 1 ,...,y n _ 1 ) = 0 (i = 1, (6) 

Conversely, any solution {y[, ■■■,y' n _ 1 ) of (6) implies the existence 
of a solution y' n of the equations (4) obtained by specialising 
yi> •••> Vn—\ to y'v-,y'n-i, and the solution (y' v ... ,y ' n ) of (4) implies 
the solution (yi + % y' n , ■ ■ ■ , y'n-i + K-i y'n, y'n) of ( 1 ) . 

Thus we have shown how to find a set of equations (4) in n — 1 
unknowns, the existence of a solution of which is a necessary and 
sufficient condition for the existence of a solution of the equation 
(1) in n unknowns. Repeating this process n times we can determine 
whether a given set of equations (1) has a solution. 

Theorem I. If the equations 

9i(xi, ...,*«) = 0 (i = 1, ...,r) (1) 

have no common solution, there exist polynomials Afx^, ...,x n ) such 
that r 

2 M* i» — »*«) 9i( x i. •••»*«) = !• 

i =*1 

We have already [§ 6, (6)] proved this result in the case n = 1, so 
we proceed by induction. To this end, we assume the truth of the 
resultfor equations in n — 1 unknowns, in particular for the equations 
(6) above. Since equations (1) have no solution, neither have (6) 
and therefore there exist polynomials B i (y 1 , ...,y„_ x ) such that 

1 = 2 Bfyy, . . . , y n _ x ) Hfy^ ..., y n _f). 

1 = 1 

Using the property found above of the polynomials ...,y»-i) 

we have 8 r 

1 = S Bfy^ . .., y n _j) S A i} (y ly ...,y n ) G,(y v ...,y n ) 

1-1 j=l 

r 

= s myv ■ • y n ) Qj(yv ■ ■ ■> yJ> 

j-1 

8 

where C. ){y v . . . , y n ) = 2 B { (y lf ..., y n _f) ...,y n ). 

i = 1 

Replacing y t by x t -u t x n (i = 1, ...,n- 1) and y n by x n we have 

r 

1 b S A<(* x , . . . , x n ) gt(x v ...,x n ), 

<-l 

. ..,x n ) = Cj(a; x — u x x n , . ..,x n ). 


where 
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This important result will be used in the next section. Apart from 
this application we shall not have occasion to consider other pro- 
perties of non-homogeneous equations, and we do not pursue the 
topic further. 

8. Hilbert’s zero-theorem. Let 

ft( x v = 0 (* = l,...,r) ( 1 ) 

be any set of non-homogeneous equations with coefficients in K, 
and let g(x v ■ ■■,x n ) be a non -zero polynomial which is such that 

g(£l,-,Zn) = 0 

for every solution . . . , of ( 1 ) which is algebraic over the ground 
field. We prove Hilbert’s zero-theorem: 

Theorem I. There exist polynomials a i (a; 1 , ...,x n ) and an integer 
s such that 

r 

\g{x v ...,»„)]•= 2 a<(*i> • ••,*„)• 

i= 1 

Let z be any new indeterminate. Consider the equations (1) 
together with the additional equation 

zg{x lt ...,x n )- 1 = 0 (2) 

as equations in K\x x , ...,x n ,z]. This new set of equations has no 
solution in any algebraic extension of K, since for any such solution 
the left-hand side of (2) is — 1. There can be no solution in any 
extension of K, since this would involve the vanishing of the 
resultant system, and hence lead to a solution in an algebraic 
extension of K. Hence [§ 7, Th. I] there exist polynomials 

A i( x v ...,x n ,z), B(x v ...,x n ,z) 
in K[x v . . ., x n , z] such that 

r 

1 = S A »(%> •••>«„, *)/<(* 1 > 

% “ 1 + B(x v ...,x n ,z) [zg(x v . . . , x n ) - 1]. 

This identity remains true when we replace z by \g(x v ...,a; n )] -1 . 

Hence r 

1 = 2 Afa, ...,x n , g~ l )fi{x x , ...,x n ), 

i = 1 

and multiplying both sides by a suitable power of g(x v ...,x n ) we 
have the required result: 

r 

[ 9 (x 1, 1. —.*«)/<(* i, —,*»)» 

i = 1 

the «<(*!, ...,x n ) lying in K[x v ...,x n ]. 


( 3 ) 
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From this we deduce immediately that if (1) is a system of homo- 
geneous equations, and g(x v ...,x n ) is a form of degree m which is 
satisfied by all non -trivial solutions of (1) in an algebraic extension 
of K, then a relation (3) holds, the a^x^ ...,x n ) being homogeneous 
and of degree ms—m { , where m i is the degree offfx^ ..., x n ). 

9. The resultant ideal. The procedure we followed in obtaining 
the resultant system of a set of homogeneous equations was not 
symmetric in the indeterminates x 0 , ...,x n . We eliminated these in 
a definite order, and a different order of elimination may well lead 
to a different resultant system. Also, the number of forms which 
appear in the resultant system has not been indicated. We show 
here that a single resultant suffices for n+ 1 forms in n + 1 variables, 
whereas for any number of forms less than n 4- 1 no condition for 
solubility in a suitable extension of the ground field is necessary. 
The method is based on the consideration of resultant-forms. 

Let the set of homogeneous equations considered be 

/*(*<» —>* n ) = 0 (*- 1 , 

where ffx Q , =a t x^+ ... +«,*£* (i=l,...,r), 

all coefficients being indeterminates. In §6 we have already en- 
countered polynomials d v ...,d N in the coefficients of the forms f { 
with the properties expressed by the identities 

r 

dtX T k = £ Ajixo, ..., xjfoixo, ...,x n ) 

for a suitable r and a suitable k, the Aj being polynomials in x 0 , . . . , x n 
with coefficients in the ring of coefficients. We use this property to 
define resultant-forms. 

Any polynomial T(a v . . ., a r , . . . , <o lf . . . , o) r ) in K[a v . . . , w r ] for 
which there is a relation 

r 

Tx T k a S A j( x 0 - • • • . *»)/*(* 0 . • • • » * n) (!) 

for a suitable r and a suitable k, the A } being polynomials in 
x 0 , ...,x n with coefficients in K[a v ...,&» r ], is called a resultant-form. 
The polynomials d v ...,d N of §6 are all resultant-forms. We may 
give another definition of resultant-forms. Write 

fi = a t X S*<+ ... +(0 i X%<=f$+(0 i X%i. 
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Then if we substitute f*i m- 

in (1), the/f all vanish, and therefore 


x T k T^a ly ...,a r , gMi > •••> — 


( 2 ) 


( 3 ) 


Conversely, if for any polynomial T(a v ...,a r , ...,o) v ...,o) r ) the 
relation (3) holds, we have 


T(a v ...,a r ,...,o) v ...,(o r ) 

“ i ’ - & + " i+ & " £ + "' + #) 

+ S (“* + ^ ■ ■■■■ - . 


on expanding the polynomial T(a v . . . , a r , ...,(o v ..., o) r ) in ascending 
powers of w i + (/*/«”*)• Multiplying this last equation by a suitable 
power of x n , and using (3), we find 


K T(a v . . . , co r ) = S B^, . . . , x n )fi(x 0 , ...,x n ). 
i 

Hence T is a resultant-form. From (1) we can deduce (3), and from 
(3) arrive back at (1), but with x n instead of x k . Since the suffix n 
plays no special part in this process, we see that from (3) we may 
deduce (1), k being any one of the numbers 0, 1, 

The resultant-forms can be used as a resultant system for the set 
of homogeneous equations 

U x o> •••>*„) = 0 (i = 1, ...,r). (4) 


For if, after specialisation of the coefficients, there is a non-trivial 
solution, and we substitute this in (1), the right-hand side vanishes, 
and since not all x i are zero, T = 0,T being any resultant-form. On 
the other hand, if all resultant-forms vanish for a given specialisation 
of the coefficients of (4), those resultant-forms d v ...,d N which belong 
to the resultant system of (4) also vanish, and the set of equations 
has a non-trivial solution. As we shall see immediately, the resultant- 
forms constitute an ideal in the ring of polynomials K[a v 
If we can find a basis for this ideal [cf. § 2], this same basis will serve 
as a resultant system for the equations (4). 
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If S(a v ...,(o r ) and T(a v ...,(o r ) are two resultant-forms, and 
R(a v ...,o) r ) is any element of K[a v ...,<y r ], then since 


8 



•••) 




— 0 , 


and 



it follows that 



= 0 , 


8(a x , ...,(o r )- T(a x , ...,w r ) and R(a v ...,(o r ) S(a v ...,(o r ) 

are resultant-forms. Hence resultant-forms constitute an ideal in 
K\a x , ...,<y r ]. We call this the resultant ideal. 

If the product of two elements of K\a v . . . , w r ] lies in this ideal, 
one of the elements must lie in the ideal. For if R{a v ...,(o r ), 
U(a v ...,<o r ) are the elements, we have 



One of the factors on the left must vanish, and therefore one of the 
polynomials must be a resultant-form. An ideal with this property 
is called a prime ideal. We now prove 

Theorem I. If the number r of forms f { is less than n+ 1, there 
are no resultant-forms different from zero. 

If T is a non-zero resultant-form, 



T^a v ...,a r , ..., - 

ft 

1 

*41 1^* 

III 

O 

(6) 

but 

T{a v ...,a r ,. 


•,W,)# 0. 

(6) 


We show that a relation of this type continues to hold even when 
the coefficients a v ...,a r , ... are no longer indeterminates, but 
specialisations a v ...,a r , ... in K. Consider, first of all, the special- 
isation a 1 -»a 1 . It may happen that 


T(a v ...,a r , ...,«!, ...,w r ) = 0. 

If this is so, 

T(a v ...,a r ,...,o) 1 ,...,(o r ) = (a 1 -d 1 ) 8 T*{a 1 ,...,o) r ) 

for some integer s, where 

T*{a v ..., a r , ...,w r )#0. 



9. THE RESULTANT IDEAL 


169 


Since 
it follows that 


T*{a v ...,a r , ..., ~~m^j =0» 


and so T*(a v ...,a r , (o v ...,(o r ) is a non-zero resultant-form. Special- 
ising each of the indeterminates present in turn, we find at each stage 
a non-zero polynomial which vanishes when the substitution (2) is 
made; that is, a resultant-form. Hence, finally, we obtain a non-zero 
resultant-form when 


provided that we began with a non-zero resultant-form before any 
indeterminates were specialised. 

Now, since r <n + 1, we may specialise a v ...,a r ... so that 
where 


fi = xfii + (oft™* (» = 1 ,...,r). 


The existence of a non-zero resultant-form before the specialisation 
involves the existence of a non -zero polynomial 


T{(o v ...,(o r ) 

such that _,/ _ n 

r^-’ cr * 

But x 0 , ...,x r _ v x n are independent indeterminates, and we are 
therefore led to a contradiction. This proves the theorem. 

It follows that the resultant system of r(r <n + 1) homogeneous 
equations in »+l unknowns vanishes identically, and therefore 
that the equations always have a solution in some algebraic exten- 
sion of the ground field. 

Theorem II. There is a non-zero resultant-form for n+\ equations 
in n + 1 unknowns. 

There must be one, at least. For if not, n+ 1 homogeneous 
equations always have a solution, for any specialisation of the 
coefficients. But there is an evident specialisation for which the 
equations become 

x?* = 0 (i = 0, ...,n), 


and these equations have no non-trivial solution. Our assumption 
is therefore false, and there is at least one non-zero resultant-form! 
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The argument of the previous theorem shows us that when 
r = n + 1 there can be no non-zero resultant-form which is indepen- 
dent of (i) n+ i . Assumption of the contrary hypothesis leads to an 
algebraic equation connecting x 0 , ...,x n . 

Hence if we choose, from the ideal of all resultant-forms in 
Afeq, ...,<t> n+1 ], a polynomial whose degree in (o n+1 is the least 
possible, this degree is not zero. If this polynomial is reducible, one 
of its irreducible factors must belong to the ideal, which is prime, 
and this factor must also be of the minimum possible degree in 
(o n+1 . We call this resultant -form R, and prove 

Theorem III. Every resultant-form is divisible by R. 

Let R = A<vi +1 +..., 

when arranged in descending powers of 6> n+1 , and let 




be any member of the ideal, similarly arranged. Since ju, > A, 

A8-Bo)^~ + \R=T 


is another member of the ideal, with a lower degree in w n+1 than 8. 
Repeating this process, using T instead of 8 for the next stage, we 
finally arrive at a polynomial 


T*=A k S — QR 


in the ideal which is of lower degree than R. This must be a zero 
polynomial, and therefore 

A k 8 = QR, 


showing that A k 8 is divisible by R. Since R is irreducible and 
cannot divide A, which is independent of o) n+1 , R divides S. This 
proves the theorem. 

This resultant-form R is therefore a basis for the ideal of resultant- 
forms in K[a v ...,&> n+1 ]. A necessary and sufficient condition for 
the n + 1 homogeneous equations 

fi(x 0 ,...,x n ) = 0 (i = 1 , ...,7i+l) 

to have a non-trivial solution is given by the vanishing of R. We 
call R the resultant of the system. 
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10. The {/-resultant of a system of equations. Suppose that 
we know that a system of homogeneous equations over K 

/<(*»> ~ 0 {i = 1) • ••>^’) (1) 

has a finite number of solutions. Let these be 


& a u = i *). 


in some extension of K. 

To the equations (1) we add the equation with indeterminate 
coefficients 


/r+1 = U 0 X 0 +-"+Vn = 0. 


( 2 ) 


Since the u i are indeterminates, 

i = 0 

and therefore the resultant-forms of equations (1), (2), of which (1) 
are specialised, do not all vanish. These resultant-forms are poly- 
nomials in u 0 , ...,u n . If one such is T(u), 

r+1 

T{u ) = 2 Ai(x 0 , ...,x n ,u) fi(x 0 , . ..,x n ). (3) 

i = l 

Now the f i? if they contain u 0 , ...,u n , are homogeneous in these 
indeterminates. If we equate sums of terms of equal degree in the 
u’s on each side of (3), we obtain equations similar to (3), but the 
T{u) are now homogeneous in u 0 ,...,u n . These T{u) are still 
resultant-forms, satisfying equations of type (3), and we therefore 
need only consider resultant-forms which are homogeneous in 
u 0 , ..., u n . 

Let b t {u) (4) 


be a basis for the resultant-forms of equations (1), (2). We may 
assume that each b^u) is homogeneous in the u’s, and is also of 
degree greater than zero, since all resultant-forms of the set (1) 
vanish. The set of forms (4) serves as a resultant system for the r+1 
equations given by (1) and (2). 

This resultant system vanishes for the specialisations 


Ui^-Ui (i = 0, 


if and only if a solution of the equations (1) satisfies (2); that is, if 

“ d only lf o, 


for some value of j. In other words, the common zeros of the 
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forms (4), considered as forms in the set u 0 , are precisely the 

zeros of the product 

<*(*)= n/w= n ... +u n &>) 

3-1 1 

in a suitable extension ring of K[u\. 

It follows by Hilbert’s zero-theorem that on the one hand 

[b^Y^A^Oiu) (i = l, ...,<)» (5) 

and on the other that t 

[£(«)]' = z«i(«) &<(*). (6) 

t = 1 

Now, the/y) are linear forms in u 0 , ...,u n , and therefore irreducible. 
From (5) the bfu), and therefore their highest common factor D(u), 
can be divided by the linear factors /y). But (6) can also be written 

[G(u)y = B(u)D(u), 

from which it follows that D(u) contains no other linear factors but 
these /y>. We deduce therefore that 

D(u) = n (u 0 gjf> + • • • + §»V* K>1)- 

i-i 

Since D(u) is the highest common factor of the forms (4) which 
are a basis for the resultant-forms of equations (1), (2), D(u) divides 
all resultant-forms, and, in particular, D(u) divides every member 
of any resultant system of the r+l equations (1) and (2). We 
have proved 

Theorem I. The linear forma /y> which determine the solutions 
of equations (1) are found by the resolution into factors, in a suitable 
extension field, of the form D(u). This form D(u) is the highest common 
factor of the resultant-forms off v ...,f r , f r+1 , and is called the u-resultant 
offv—fr- 

If the set of equations (1) has an infinite number of solutions, 
we may extend the notion of the m - resultant. We consider the 
equations (1) together with the set of linear equations with indeter- 
minate coefficients 

n 

fr+i=h u i] x j = 0 (*= 1 , ( 7 ) 

j =0 

If s is large enough, the sets of equations ( 1 ) and ( 7 ) have no common 
solutions. For if we take s = n+ 1, the set (7) taken by itself has 
no solution. 
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Choose s to be the smallest integer such that (1) and (7) have no 
common solution. Consider the resultant-forms of these equations. 
Generalising our previous argument, we see that the only resultant- 
forms we need consider are those homogeneous in each set of 
indeterminates 

~ ( u i0> ' • •> ^ in ) (* = 1 » •••»£)• 

If for the specialisations 

U 8j^U s j (j~ 0 , —,») 

the equations (1) and (7) have a solution (£ 0 , ...,£ n ), all resultant- 
forms T(u v ...,u s ) vanish for this specialisation. That is, 

T(u v ...,u s _ v u 8 ) = 0. 

Reasoning as above, we see that 

n 

2 

i-0 

is a factor of T(u v ...,u s ). If D(u v ...,u s ) is the highest common 
factor of all resultant-forms, the factor (7) divides D(u lf 

But since this highest common factor, regarded as a form in 
u s0 , •••,u sn has only a finite number of linear factors in any exten- 
sion field, the equations 

fi = 0 (* = 1 r+8-1) ( 8 ) 

have only a finite number of solutions. 

The M 8 -resultant of (8) is D(u v ...,u s ). It may also be called the 
w-resultant of equations (1). In a case which will be considered in 
vol. H, in which the equations (1) define an irreducible variety, this 
generalised w-resultant is of considerable importance. 
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BOOK II 

PROJECTIVE SPACE 


CHAPTER V 

PROJECTIVE SPACE: ALGEBRAIC 
DEFINITION 

1. Introduction. It has been customary to follow two distinct 
and alternative lines of approach in defining a projective space of 
n dimensions. One is purely algebraic, and the other is based on the 
so-called ‘propositions of incidence’. We propose to consider both 
methods of approach, devoting this chapter to the algebraic 
approach, and the next to the synthetic method. One of our main 
objects will be to compare the two methods in considerable detail. 
For this reason we shall begin by defining a more general space than 
that with which we shall ultimately be concerned. We shall impose 
new conditions only when we have examined their significance in 
some detail. 

Our first action in defining a projective space algebraically is to 
select the ground field K. Initially we shall impose no restrictions 
on K. It may be non-commutative, and of finite characteristic. 
When K is non-commutative we shall be led to consider another 
field K, which bears to K a relation similar to isomorphism, but 
differing in one respect. To define this relation, which we call skew- 
isomorphism, in general terms, let 8 denote a set of elements in 
one-to-one correspondence with the elements of K , a and a denoting 
corresponding elements of K and S respectively. We define addition 
and multiplication in the set by the rules 

o + 5 = a + 6, ab = (6a). 

It is easily proved that the elements of S, with these laws of com- 
position, form a field, which we denote by K. This field K is iso- 
morphic with K, the reversal of the order of the factors in the product 
being, simply, a change of notation for the product of two elements 
o and 5 in K. But we shall find it convenient to distinguish between 
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an isomorphism between two fields K and K in which ab corresponds 
to ab, and one in which ab corresponds to ba. We shall refer to the 
first case simply as isomorphism, and to the second as skew-iso- 
morphism.* In the case of a commutative field, of course, there is 
no difference between isomorphism and skew-isomorphism. 


2. Projective number space. A set (<x 0 , a n ) of n + 1 elements 

of the ground field K will be called an (n + l)-tuple if not all the 

elements a 0 , ...,a n are zero. Two (n + 1 )-tuples (a 0 , . . . , a n ), (/? 0 , . . . , fi n ) 

are said to be right-hand equivalent if there exists an element A of 

K such that />■»/• « x 

«< = &A (* = 0 ,...,n), 

and left-hand equivalent if there exists an element p of K such that 

= pfii ( i = 0,...,n ). 


Clearly, when the two (n + l)-tuples are right-hand equivalent, A 
cannot be zero, since, otherwise, each a. i would be zero. Similarly, 
if they are left-hand equivalent, p cannot be zero. Right-hand 
(left-hand) equivalence is clearly reflexive, symmetric and transitive. 

A set of right-hand equivalent ( n+ l)-tuples is called a point of 
right-hand projective number space of dimension n over K. The 
aggregate of such points is called a projective number space of dimen- 
sion n over K. This is denoted by PN r n (K). If left-hand equivalence 
is used in place of right-hand equivalence we obtain a left-hand 
projective number space of dimension n over K, denoted by PN l n (K). 
If K is commutative, pN‘ n (K) = PN r n {K). 

If K is not commutative, and K is skew-isomorphic to K, two 
(n + l)-tuples (a 0 ,...,a n ), {ft 0 , ...,/? n ) over K are right-hand (left- 
hand) equivalent if and only if the corresponding (n + 1 )-tuples 
(a 0 , ...,a n ), (/? 0 , ...,/? n ) are left-hand (right-hand) equivalent. 
Hence, in an obvious sense, PN l n (K) and PN r n (K) are similar spaces, 
and PN l n (K) and PN r n (K) are also similar. For this reason it will 
be sufficient to develop the properties of PN r n (K), those of PN l n (K) 
being read off from those of PN r n (K). 

Let A = (a^) be any regular (n+ 1) x (n+ 1) matrix whose 
elements are in K, and consider the equations 

n 

Vi = (i = 0,...,n), (1) 

i- o 


* van der Waerden uses the term inverse -isomorphism. 
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and also the equivalent equations 

= YtPaVi (i = 0,...,rc), (2) 

j=0 


where B = is the inverse of A. If (x 0 X, :..,x n X) is a set of right- 
hand equivalent (»+ l)-tuples, the equations (1) transform these 
(n + 1 ) -tuples into the set (y 0 A, ...,y n A) of equivalent (n + 1 ) -tuples ; 
that is, the equations (1) transform a point of right-hand projective 
number space into a point of right-hand projective number space. 
By virtue of (2) this correspondence is one-to-one. 

A set of equations (1), which we shall usually write in matrix 


form as 


V = Ax, 


( 3 ) 


determines a projective transformation of PN r n {K) into itself. 

To each regular (n+ 1) x (n+ 1) matrix A over K there corre- 
sponds a projective transformation ( 1 ) . Let A x and A 2 be two regular 
matrices, defining the transformations 

y = a x x, (i x ) 

y = a 2*. (i 2 ) 

We operate on a point x with (l x ), and follow with (1 2 ). The point x 
becomes y = A x x, and y is transformed into 2 = A 2 y = A 2 A 1 x; 
and so the relation between x and z is 


2 = A 2 A x x. 

Since [II, §3] A 2 A X is regular, the product of the two projective 
transformations of PN r n (K) given by (l x ) and (1 2 ) is a projective 
transformation of PN r n (K), and hence we have 

Theorem I. The projective transformations of PN r n (K) into 
itself form a group. 

Projective transformations of PN l n (K) can be defined in a similar 
way. It should be noted that the equation of the transformation 
corresponding to the matrix A is now 

y = xA, (4) 

where x and y are now 1 x (h + 1) matrices. 

Let K* be an extension of K. The points of PN r n (K) are a subset 
of the points of PN r n (K*). The projective transformations of 
PN r n (K*) correspond to the regular {n+ 1) x {n+ 1) matrices over 


HP I 


12 
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K*, and a subset of these are matrices over K. The projective trans- 
formations corresponding to the matrices of this subset form a sub- 
group of the group of projective transformations of PN r n (K*), 
and under the transformations of this subgroup the points of 
PN r n (K) are transformed into points of PN r n (K). We say that 
PN r n (K*) is the extension of PN r n (K) corresponding to the extension 
K* of K. Similarly, the group of projective transformations of 
PN r n (K*) is an extension of the group of projective transformations 
of PN r n (K). 

3. Projective space of n dimensions. Let 8 be a set of elements 
which can be put in one-to-one correspondence with the points of 
PN r n (K), and let T denote a correspondence such that if A is any 
element of S, and B any point of PN r n (K), there corresponds to A 
a unique point T{A) of PN r n (K), and to B a unique element T~ 1 (B) 
of S. Then to each element A of S there corresponds a set of 
equivalent (»+ 1) -tuples (a 0 A, ..., a„ A), where T(A) is (a 0 , ...,a n ). 
Any (n + 1) -tuple of this set is called a set of coordinates of A. 

Now consider a projective transformation in PN r n {K). If this takes 
the point T(A) of PN r n (K), which corresponds to the element A of 
S, into T'(A), the correspondence between the elements A of 8 and 
the points T'(A) of PN r n (K) is one-to-one, and thus a new coordinate 
system is defined in S. Indeed, a coordinate system is defined for 
each projective transformation of PN r n (K). The various coordinate 
systems obtained in this way will be called allowable coordinate 
systems in S. From the group properties of projective transforma- 
tions, discussed in §2, it is clear that the same set of allowable 
coordinate systems in S is defined if the initial correspondence T 
between S and PN r n (K) is replaced by any other correspondence T' 
which gives an allowable coordinate system. 

The set 8, with the set of allowable coordinate systems, is called 
a right-hand projective space of n dimensions over K, and the elements 
of 8 are called the points of this space. The space may be denoted by 
P!^(K), but various other notations will be used. In the case in which 
K is commutative, and there is no doubt about the ground field, the 
symbols [n], 8 n , S' n will be used to denote a projective space of n 
dimensions, the notation S n being particularly useful when a number 
of projective spaces have to be considered simultaneously. 

If ( x 0 , x n ) is a set of coordinates of a point P in in a 

given allowable coordinate system, the coordinates {y 0 ,--.,y n ) of 
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this point in another allowable coordinate system are given by the 
equations n 

yi = 'L<XijX} (i = 0 n), 

j =o 

where A = (a i} ) is a regular (n+ 1) x (n+ 1) matrix over K. The 
equations for transformation of coordinates in P r n {K) can therefore 
be written, in matrix notation, as 

y = Ax, 

and there is one such transformation for each regular (n + 1 ) x (n + 1 ) 
matrix over K. 

We consider how P^K) can be extended when the field K is 
replaced by an extension K*. In § 2 we saw how to extend PN r n (K). 
We now embed the set S in a set S* which can be put in one-to-one 
correspondence with the points of PN r n (K*) in such a way that the 
elements of S* which lie in S correspond to points of PN r n (K*) 
which are in PN r n (K). This is clearly possible; a set such as S* can 
be formed by the elements of S and the points of PN r n (K*) which 
are not in PN r n (K). Using the given correspondence between S* 
and PN r n (K*), and the projective transformations of PN r n (K*), we 
construct a projective space of n dimensions P r n {K*) whose points 
are the elements of S* in which P^iK) is embedded. Amongst the 
allowable coordinate systems of P T n {K*) there is a subset which 
induces the allowable coordinate systems of P r n (K). 

In later chapters we often have to replace a field K by an exten- 
sion K*. When this is done we shall assume that P^K) has been 
extended to P'^K*), as above. The effect of this is to add new points 
to P r n (K), and to exteqd the group of allowable transformations. 

Let us suppose that we have a coordinate system in P r n (K*) in 
which the points of P^(A) can be given coordinates in K. Consider 
the coordinates (aj, .. ., a*) of a point R in P r n {K*). 

(i) If, for some A* in K*, 

a* A* (i = 0, ...,») 

is in K, the point R is in P r n {K), and is sometimes called a rational 
point of P r n (K). 

(ii) If, for some value of A*, each a* A* is algebraic over K, R is 
called an algebraic point of P'^K). 

(iii) If, however, for all A* in K*, at least one af A* is transcen- 
dental over K, R is said to be a transcendental point of P r n {K). 


12-2 
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This completes our description of a right-hand projective space 
P r n {K). The method of construction of a left-hand projective 
space P l n (K) is now evident, and the reader can immediately read 
off the relations between P l n (K) and P r n (K), where K is skew- 
isomorphic to K. 


4. Linear dependence in P'XK). Consider k+1 points 

AO Ak 

XI j • . . y Xl 


in P^(K), and let 


be their coordinates in an allowable coordinate system. We say that 
the points A 0 , ...,A k are linearly dependent in P^iK) if there exist 
elements A 0 , . . . , X k of K, not all zero, such that 

k 

2a*A, = 0 (i = 0, (1) 

j=0 

In order that linear dependence may be considered as a property of 
the points A 0 , ...,A k , it must be shown to be independent of the 
coordinates selected for A 0 , ..., A k in the chosen coordinate system, 
and also to be independent of the allowable coordinate system. 

To prove the first result, let (/?j, ...,/?£) be other coordinates for 
A i in the given allowable coordinate system. Then 

Pi = a W 

where /i t is some non-zero element of K. From (1) we deduce that 

= o (* = <>,...,»). 

3 = o 

Hence, if a relation of type ( 1 ) holds for one choice of the coordinates 
in a given allowable coordinate system, a relation of this type also 
holds for any other choice in the given coordinate system. 

Now consider any allowable transformation of coordinates 

giTen by y = Ox, (2) 

where C = (y { j) is a regular (n+ 1) x (n+ 1) matrix over K. In the 
new coordinate system the coordinates of A i can be taken as 
where » 

A 1 = 2 y jp cA p . 

P “0 

Then S fi* t = 22 Jip^pK = 0. 

<-0 i-0 p- 0 

from (1). Hence, again, if a condition such as (1) holds in one 
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allowable coordinate system, a similar condition holds in any other 
allowable coordinate system. Thus the linear dependence of points 
in P^K) is a property of the points, and does not depend on the 
allowable coordinate system. 

Recalling the properties of the linear dependence of vectors 
derived in Chapter II, we have the following alternative statements 
of the conditions for the linear dependence of the points A 0 , . ..,A k . 

(i) If (<Xo, ...,a*) (i = 0 , ...,k) are the coordinates of A 0 , ...,A k in 
a given allowable coordinate system, a necessary and sufficient condition 
that A 0 , ...,A k be linearly dependent is that the Jc + 1 right-hand vectors 

(ao. •••,<*») (i = 0,...,k) 

be linearly dependent. 

(ii) A necessary and sufficient condition that A 0 , ...,A k be linearly 
independent is that the matrix 



be of rank &+ 1. 

An immediate corollary is 

Theorem I. Any m points of P r n {K), where m>n+ 1, are linearly 
dependent. 

We can introduce a useful symbolic notation if we assume that 
the symbol A 1 for a point represents a particular (n + 1) -tuple in the 
class of equivalent ( n + l)-tuples which give the coordinates of A i 
in a given allowable system. The transformation (2) then defines 
a particular set of coordinates for each A i in the new coordinate 
system, and this set will also be denoted by A 1 . When we use the 
symbol A < for a point in this way, we extend our notation by multi- 
plying A i , on the right, by a non-zero element of K, so that if 
A 1 = (a 0 , ...,a„), then A* A = (a 0 A, ...,a n A), and the symbols A* 
and A*A represent the same point. The equation (1) can then be 
written as k 

I I A% = 0, 

i-0 

and this relation does not depend on the allowable coordinate system 
used. 
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Any point A such that k 

An = SA. 

i= 0 

for some non-zero n in K, is said to be linearly dependent on A 0 , ...,A k . 
If k = n, and in a given allowable system of coordinates we take 

-4’ = (ti i0 , $n, •••>£«>•••> ti in ) (i = 0, . . . , n) 

= (0, 0, 1, ..., 0), 

we obtain n + 1 linearly independent points. Since any point 

A = (a 0 ,...,a n ) 

can be written as n 

A = 2j A^Otf, 

{=0 

every point in P'JJK.) is linearly dependent on these n 4- 1 points. 
They are said to form the simplex of reference for the given coordinate 
system. 

The properties of the linear dependence of points can now be 
read off from the properties of the linear dependence of vectors 
[II, § 1]. It will not be necessary to give proofs in detail, since these 
can be deduced at once from the corresponding proofs for vectors. 
The main results are as follows. In the first, A 1 has the symbolic 
meaning just defined. 

Theorem II. If A 0 , ...,A k are linearly independent, and if 

k 

AX = SAU* 

i-0 

is linearly dependent on them, the (k + \ ) -tuple (A 0 , ..., X k ) is uniquely 
determined to within the class of equivalent (k+ 1 )-tuples. 

Theorem III. If B °, ..., B h are linearly dependent on A 0 , ..., A k , 
and P is linearly dependent on BP, then P is linearly dependent 

on A 0 , ...,A k . 

Theorem IV. In any finite set of points A 0 , ...,A k we can find a 
linearly independent subset A**, ..., A** such that any point linearly 
dependent on A 0 , ..., A k is linearly dependent on A { °, ..., A 

Theorem V. If B °, ..., B h are h + 1 linearly independent points, 
each of which is linearly dependent on A 0 , ...,A k , then h^k, and we 
can find k—h points A^+i, ...,A ik in the latter set such that every point 
linearly dependent on A 0 , ...,A k is linearly dependent on BP , ..., B h , 
A { *+i , ..., A^. (Exchange Theorem.) 
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If A 0 A k are &+ 1 linearly independent points of P r n (K), the 

aggregate of points of P r n (K) which are linearly dependent on them 
constitute a linear space of k dimensions. We shall call a linear space 
of 1 dimension a line, and a linear space of 2 dimensions a plane. 
From the first part of the Exchange Theorem we deduce that any 
&+1 points of a linear space of k dimensions, where h>k, are 
necessarily linearly dependent. On the other hand, there exists at 
least one set of k+ 1 independent points, -4°, ...,A k . If B°, ...,B k 
is any other set of k+ 1 linearly independent points of the space, 
the second part of the Exchange Theorem tells us that the linear 
space determined by B°, ..., B k is the same as that determined by 
A°,...,A k . 

Any set of k + 1 linearly independent points of a linear space of 
k dimensions is called a basis for that space. 

We use the above results to show that a linear space of k dimensions 
is a P r k {K). Let A 0 , ...,A k be any basis for the linear space. Then 
any point Q of the space is given by an equation 

q-£a%, 

i-0 

where the (k+ 1) -tuple (A 0 , ..., A fc ) is uniquely determined by the 
point Q, to within right-hand equivalence. This establishes a one-to- 
one correspondence between the points of the linear space and those 
of PN%(K). Now consider what happens if B °, ..., B k is any other 
basis for the linear space. We have 

Jc Jc 

B i = '£Ai/3 ji , and A* = 

0 j -=0 

k k 

Hence A* = £ 2 A h /3 hj a H . 

j - 0 ft -0 


But this must be an identity, since A 0 , ...,A k are linearly indepen- 
dent. The matrices „ , „ . . . . 

B = (&,-), A = (ct {j ) 

are therefore inverse matrices. It follows that A and B are both 
regular. 

k 

Now, if Q = £ AVI*, we also have 
o 

Q = X2Bict ji \ i = 'ZB%, 


Pi = Sa^A^, 


where 


(3) 
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and therefore the change of basis from A°,...,A k to 
results in the projective transformation (3) in PN r k (K). Conversely, 
any projective transformation in PN r k (K) corresponds to a change 
of basis in the linear space. Thus a linear space of k dimensions is a 
PUK), each allowable coordinate system corresponding to a choice 
of basis in the linear space. 

From Th. Ill we deduce at once that if S h is a linear space of h 
dimensions determined by a basis B °, ..., B h , and if each B i lies in 
a linear space S k , then every point of S h lies in S k . That is, S h lies 
in S k . We write this 0 „ 

&h— &k‘ 

It is evident that if S a ^S b , and S b £ S c , then S a £ S c . Also, if 
S a ^S b and S b c S a , then S a = S b . 

From Th. IV we deduce that any k + 1 points lie in an S h (h ^ k) 
and that h < k if and only if the points are linearly dependent. 

Again, let S p and S q be two linear spaces in P'^K), and suppose 
that P°, ..., P p is a basis for S p , and Q °, ..., Q 9 a basis for S q . If the 
p + q + 2 points P°, . .., P p , Q°, Q q , are linearly dependent, so that 

£p i *i+j:Q i ti { = o, 

0 0 

we deduce that S p and S q have a point in common. For neither 

P Q 

2 P i A i — 0, nor 2 = 0, since P°, ..., P p are linearly indepen- 

0 0 p 

dent, and so are Q °, ..., Q Q . Hence 2 { is a point in S p which, 

a o 

since it can be written as — 2 Q Vi. is linearly dependent on Q°, Q q , 
and therefore lies in S q . 0 

Finally, we note the trivial fact that if P and Q are distinct 
(that is, linearly independent) points of P^(A) the point which is 
represented symbolically by P + Q is necessarily distinct from both 
P and Q\ hence, since any S 1 contains at least two points, any 
contains at least three points, and we may sum up the results 
established in this section as follows: 

In P r n (K) there are linear spaces of dimensions 0,1,2 a 
space of 0 dimensions being a point. Between any two such spaces 
S h and S k there may exist a relation of one ‘lying in’ the other, say 
S h <=:S k . This relation satisfies the conditions 

I. If S h ^S k , and S k ^S h then S k = S h . 

II. If S v c S a and S 0 £ S r , then S v £ S r . 
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A set of p + 1 points is linearly dependent if there exists a linear 
space S q containing them, where q<p. 

III. Every 8 X contains at least three distinct points. 

IV. Given p + 1 linearly independent points there exists at least 
one 8 p containing them. 

V. In any S p there exists at least one set of p + 1 linearly in- 
dependent points. 

VI. If P °, . . . , P p are p + 1 linearly independent points which lie 
in an S q , any S p containing them lies in S q . 

VII. If S p and 8 q are any two linear spaces of dimension p and q, 
and P°,...,P p are p+ 1 independent points of 8 p and Q°, ..., Q 9 
are g+1 independent points of S q , then if the p + q + 2 points 
P°, P p , Q°, are linearly dependent there exists at least one 
point R which lies in S p and in 8 q . 

VIII. There exists at least one set of n + 1 linearly independent 
points, but any set of m points, where m > n + 1, is linearly dependent. 

We have, indeed, proved a great deal more, but it is desirable at 
this stage to emphasise that a right-hand projective space satisfies 
these propositions, which we call the Propositions of Incidence. In 
the next chapter we shall begin by taking an aggregate of undefined 
elements, which we shall call linear spaces of dimensions 0, 1, 2, .... 
We shall postulate the existence of a relation <= between certain of 
these spaces, and assume that this relation satisfies I and II above. 
The statement which follows II is then taken as a definition of linear 
dependence. We shall then impose the further conditions III- VIII, 
noting that it is only in VIII that the number n appears. It will be 
our main object in Chapter VI to show that these linear spaces of 
dimensions 0, 1,2, ... can necessarily be represented as the linear 
spaces of a P^(K), for a suitably chosen K. This will follow directly 
from the above postulates if » > 2. If % = 2 a further postulate will 
be necessary. The case n — 1 is without interest. 

For a left-hand space P l n (K) a theory of linear dependence, 
similar to that developed for P^iK), may be stated. If 

A i = (al ...,a*) (i = 0,...,k) 

are k+ 1 points, the points linearly dependent on them are the points 

...,£A f a* 

o \ o o 

and the theorems proved earlier in this section can be proved for 
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Pn(K), with the necessary formal alterations. In particular, we 
arrive at the same propositions of incidence, and we shall therefore 
be able to prove that, with the necessary reservations if n < 2, 
these propositions serve to define a left-hand projective space. This 
implies that a left-hand projective space may be regarded as a 
right-hand projective space, and conversely. 

The explanation of this fact will be found in our method of con- 
structing a projective space from the propositions of incidence. 
We shall have to construct a field, and we shall see that there are 
two ways of doing this, leading to two fields which are skew- 
isomorphic. Now, if K is skew-isomorphic to K, there is a one- 
to-one correspondence between the points of P^^K) and those of 
P l n (K), and a one-to-one correspondence between their allowable 
coordinate systems. Choosing corresponding allowable coordinate 
systems in P l n (K) and P^iK), the points 

(<4, ...,a£) and (5$,..., a*) 
correspond. The point 

(Z‘ A*a£, ...,rA*a£) 
of P l n (K) corresponds to the point 

of P'n^K). Hence, the points of P l n {K) which are linearly dependent 
on a given set of m points of P l n {K) correspond to the points of 
P^(X) linearly dependent on the m corresponding points. Indeed, 
corresponding to any property of spaces S a , S b , ... in P l n {K) which 
remains unchanged under any allowable projective transformation 
in P l n (K) there is a property of spaces S a , S b , ... in P r n {K) which 
remains unchanged under any allowable projective transformation 
in P n (K). We say that the two spaces are protectively identical. 

5. Equations of linear spaces. Duality. We consider a right- 
hand projective space P r n {K), and a left-hand projective space 
P l n (K). In each of these we fix, quite arbitrarily, an allowable 
coordinate system, but, having done this, we regard the two as 
associated. We now show how we can proceed to associate with each 
allowable coordinate system in P r n {K) a unique allowable coordinate 
system in P l n (K). 
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If the transformation from the selected coordinate system in 
P'niK) to the new coordinate system is 

y = Ax, 

the transformation from the selected coordinate system in P l n {K) 

to the coordinate system which we associate with the new system in 

P'ni.K) is taken to be , _ . A _ x 

y — x Ji 

(where x' and y' are, of course, 1 x (n + 1) matrices). 

It can be verified at once that this association of allowable co- 
ordinate systems in the two spaces is one-to-one, and is determined 
by the above rule when the selected coordinate systems are any pair 
of associated systems. 

Now consider a linear space S k in P r n (K). Let A°,...,A k be a 
basis for this space, where 

A i = (<4, ...,<) 

are the coordinates of A 1 in a given allowable system. Consider the 
equations n 

^Xjaj = 0 (i — 0, ...,&). (1) 

j =o 

Since the (n + 1) x (&+ 1) matrix (aj) is of rank &+ 1 [§4, (ii)], the 
solutions of the homogeneous equations (1) form a left-hand vector 
manifold of dimension n — k [II, §6]. Let the non-zero vectors of 
this manifold be taken as the coordinates of the points of a linear 
space in the associated coordinate system in P l n (K). Its dimension 
will be n — k- 1. We denote the space by Z n _ k _ v If 

«...,<) (i = l,...,n-k) 

is a basis for 2’ n _ fc _ 1 , the (n — k)x(n+ 1) matrix (uj) is of rank n — k, 
and from (1), 

71 

= 0 (* = 1, ...,n — k; h = 0 k). (2) 

j'-o 

But, since (uj) is of rank n — k, the equations 

= 0 (i = 1 n — k) (3) 

j -0 

are satisfied by vectors of a right-hand vector manifold of dimension 
k+l. The non-zero vectors of this manifold are the coordinates of 
points of a linear space of dimension k in P'niK). By (2) above, this 
space contains A°, ...,A k , and therefore coincides with S k . Thus 
S k determines 2’ n _ fc _ 1 , and conversely determines 8 k . 
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We now show that this association does not depend on the 
particular associated pair of coordinate systems in I\(K) and 
P l n (K) of which we have made use. 

To see this, we first note that the relation between S k and 27 n _ fc _ 1 
can be expressed as follows: the equation 

n 

£ UiX { = 0 
i-0 

is satisfied for every point (** 0 , of P n _ k _ v for all choices of 

(« 0 , ...,x n ) in S k , and conversely. Now, if we perform the trans- 
formation of coordinates . 

y — Ax 

in P r n (K), and the associated transformation 

v = uA~ l 

in P l n (K), then (noting that u, v are both lx(»+l) matrices) 

n n 

£ v iVi = vy = uA~ x Ax — ux = 2 u t x t = 0. 
o o 

Hence the property characterising the relation between 8 k and 
£ n _fc_i remains unchanged in the new associated pair of coordinate 
systems. 

The spaces 8 k and Z n _ k _ 1 are said to be dual spaces. 

If we take any n — k points of which form a basis for this 

space, and which have the coordinates 

«...,<) (i = 1, ...,n-k) 

in an allowable coordinate system in P l n (K), then the points of S k 
in P^iK) satisfy the equations 

£ufaj = 0 (i = 1 , ...,» — lc) ( 3 ) 

j -=0 

in the associated coordinate system. We may call (3) the equations 
ofS k . 

Given any set of equations 

= 0 (* = 1 , 2 ,...), ( 4 ) 

o 

we may consider the space in P l n {K) linearly dependent on 

«...,<) (*= 1 , 2 ,...). 

If this is a 2’ n _ Jfc _ 1 , the solutions of (4) are the points of the dual 
S k , and the equations therefore determine this S k . In algebraic 
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language, if the matrix of coefficients of a set of linear homogeneous 
equations (4) is of rank n — k, these equations determine a unique S k . 

Now consider two linear spaces 8 p and S Q in P^iK) with respec- 
tive equations n 

'Zujx^O (i = l,...,n-p), 


and 


Zvjxj = 0 (i = 1, ...,n-q). 


Suppose that the (2 n—p 


o 


- q) x (n + 1 ) matrix 

(u = (“«)» v = (v i} )) 


( 5 ) 


( 6 ) 


is of rank n — k. Then the two sets of equations, taken together, 
determine an S k . Each point of S k lies in S p and in 8 g , and each 
point common to 8 p and S q must lie in S k . We call S k the intersection 
of 8 p and 8 q , and have proved 

Theorem I. The intersection of two linear spaces is a linear space. 

If the matrix (5) is of rank n — k, the 2 n—p — q points 

«...,<) (i = 1, ...,n-p), 

(«o> •••>«&) {i = l,...,n-q) 

in P l n (K) are linearly dependent on n — k of them, and therefore the 
duals Z n _ p _! and E n _ q _ x of 8 p and 8 q lie in a Z n _ k _ x , which is the 
dual of S k . Moreover, there is no smaller space containing both 
P n -p-i an d P n -q- v For any space containing both of these spaces 
must contain the 2 n—p — q points (6), of which n — k are linearly 
independent. Calling the smallest linear space which contains two 
linear spaces their join, we have 

Theorem II. The dual of the intersection of two linear spaces is 
the join of their duals ( and conversely). 

As a corollary we deduce that 
If 8 p £> 8 q , then P n _ p _i £ 27„_ 3 _ 1 . 

We now find a relation between the dimensions of the intersection 
S k and the join 8 r of two linear spaces 8 p and 8 q . Let A 0 , ...,A k 
be a basis for the intersection. By the Exchange Theorem we can 
find bases P°, ...,P V for S p , and Q°, . . . , Q q for S q such that 

Then clearly, any space containing P°, ..., P p , Q k+1 , contains 
S p and 8 q , and therefore S r , their join. Therefore 

r^p + q-k. 
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A similar argument applies to the duals E n _ p _ x , Z n -q - 1 an d their 
join, which, by Th. II is E n _ k _ v Hence 

n — k — l^(n — p — l) + (n — q — 1) — (n — r— 1), 
that is, r^p + q — k. 

It follows that k + r = p + q. 

Hence we have 

Theorem III. If S p and S q meet in S k , their join is a linear space 
of dimension p + q — k. 

This theorem remains true when S p and S q have no intersection, 
provided that we then put k = — 1. 

We have established a correspondence between P^fK) and 
P l n (K) in which an S k of the former corresponds to a Z n _ k _ x of the 
latter, and intersections and joins correspond to joins and inter- 
sections respectively. If we establish any theorem for P^iK) 
which is based only on properties of linear spaces, their intersections 
and joins, this correspondence enables us to state forthwith a 
theorem for P l n {K), without further proof. For historical reasons 
this relation between theorems for I\(K) and for P l n {K) is called 
the Principle of Duality for projective space. 

If A is a field skew-isomorphic to K, there is a correspondence 
between P l n (K) and P r n {K) in which a linear space of k dimensions 
corresponds to a linear space of k dimensions, and intersections and 
joins correspond to intersections and joins respectively. Hence, by 
the Principle of Duality, if we have any theorem in P£(A) which 
concerns linear spaces S a ,S b , ..., their intersections and joins, we 
obtain a corresponding theorem in P^iK) concerning linear spaces 
S n - a -i> ••., their joins and intersections. If A is a com- 

mutative field we can take K = K, and the dual theorem is therefore 
true in P^K). Again, if the original theorem is true for all ground 
fields K, it is true in particular for P r n (K), and therefore the dual 
theorem is true in P’ifK). 

This last case is the most usual one. The theorems we shall prove 
will be either 

(i) true for all ground fields K, 
or (ii) true for commutative ground fields K, 
or (iii) true for all ground fields with a given characteristic, 

t -I tr \ ^mirt oil nrt i -i 4- o 4-i tto nr»/l o 1 r#oVvr*o •* r»o llxr nl/xaArl 
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It will be noted that if K belongs to one of these classes, the skew- 
isomorphic field K belongs to the same class. Hence, every theorem 
which we shall prove in P r n {K) will have a dual in P^K). 

6. Desargues’ Theorem. In this section we consider the 
celebrated Theorem of Desargues which deals with two triangles in 
perspective in P^K). In order to avoid needless repetition in the 
next chapter, we shall prove the theorem and its converse as con- 
sequences of the propositions of incidence [§4]. We notice here 
some immediate deductions from the propositions of incidence, 
which we shall use in this section. A more complete treatment will 
be found at the beginning of the next chapter. 

Every 8 X (or line) contains at least three distinct points. Two 
distinct points determine one line. Any S 2 (or plane) contains three 
linearly independent points. If a line l is determined by the points 
P°, P 1 , and a line m is determined by the points Q°, Q 1 , and the 
points P°, P 1 , Q°, Q 1 are linearly dependent, that is, the lines are 
coplanar, there is at least one point R on both lines. If P is any 
point in a plane n, at least three distinct lines can be drawn through 
P; for if P, Q, R is a set of linearly independent points in n, the line 
QR contains at least one point different from Q and R. Let this be 
S. Then PQ, PR, PS are three distinct lines through P. To complete 
the deductions from the propositions of incidence for a plane we 
show that if A, B, U are three distinct collinear points in n, we can 
draw lines a, b, u through A, B, U respectively which lie in n, are 
distinct from the line A U, and do not all pass through one point. 
For there is at least one point X inn such that X, A, U are linearly 
independent. The line XU contains a point Y distinct from X and U. 
The lines a = AX, b — BY, u = UX satisfy the conditions. 

The theorems we shall use in S 3 are: a line meets a plane in one 
point, or it lies in the plane; two planes in S 3 meet in a line; at least 
three distinct planes pass through a given line. 

We now return to the consideration of Desargues’ Theorem. 

In order that we may avoid the consideration of a large number 
of special cases it will be useful to use the terms collinear and 
concurrent in the following sense: three points are collinear if there 
is at least one line containing them (if the points coincide there may 
be more than one fine); three lines are concurrent if there is at least 
one point lying on all three of them. We shall use the word triangle, 
or the term proper triangle, to denote a triad of points which are 
not collinear. 
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Two triangles ABC, A' B'C' are said to be in perspective from 0 
if there exists a point 0 such that OAA' , OBB' and OCC' are each 
collinear triads. We note that 0 need not be unique as, for example, 
when A — A', and B = B' . With these definitions Desargues* 
Theorem is 

Theorem I. If two triangles ABC, A' B'C' are in perspective 
there exists a line l passing through a point common to BC and B'C', 
a point common to CA and C'A', and a point common to AB and A' B' . 

The converse theorem is 

Theorem II. If ABC, A' B'C' are two triangles such that there is 
a line l passing through a point common to BC and B'C' , a point 
common to CA and C'A' , and a point common to AB and A'B' , then 
the triangles are in perspective. 

We remark that in both theorems the whole configuration lies 
in a linear space of either two or three dimensions. (Since A, B, C, 
are not collinear the space cannot be of less than two dimensions.) 
For in Theorem I there must be three lines a, b, c through O con- 
taining, respectively, AA', BB' , CC' . On these lines take three 
points P, Q, R different from O. Then the set O, P, Q, R determines 
an S r (r < 3) which contains O, P, Q, R and therefore 0, A, A', 
B, B', C,C', and hence the whole configuration. The line l of the 
theorem lies in this S r . We also note that if r = 3 the two planes 
determined by A, B, C and A', B' , C' are necessarily distinct, 
except in the trivial case A = A' , B = B' , C = C' . 

In Theorem II the triangles ABC, A' B'C' each determine a plane. 
Let these be tt and tt' . The fine l of the theorem meets BC, CA and 
AB, and, since A , B, C are not collinear, at least two of these inter- 
sections are distinct. Hence l has at least two points in tt, and there- 
fore lies in n. Similarly l lies in tt'. The planes n, tt' having at least 
a line in common, their join is an S r (r ^ 3). The point O must lie 
in this S r , except in the trivial case A = A' , B = B' , C = C' . 

We may therefore confine ourselves to the two cases (i) r = 3 and 
(ii) r = 2. In case (i) the planes tt and tt' determined by A, B,C 
and A' , B' , C' are distinct, in case (ii) they are not. 

Theorem I, Case (i). The lines a, b, c containing OAA ' , OBB' , OCC' 
are distinct, otherwise they would all lie in a plane. Hence any two 
of them determine a plane. The points B, C, B' , C' all lie in the plane 
determined by b and c. Therefore BC and B'C' have at least one 
point P in common. Similarly there is a point Q common to CA 
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and G'A', and a point R common to AB and A'B'. These three 
points P, Q, R all lie in 77 and 77'. Now, 77 and tt ' , being planes in S a , 
meet in a line, and this is a line l satisfying the theorem. This line 
l is unique. Indeed, if there existed a line V different from l satisfying 
the conditions of the theorem, l' could not lie both in 77 and in Tt' . 
Suppose it did not lie in 77. Then it would meet 77 in a point common 
to BC, GA, AB, which is impossible, since A, B, C are not collinear. 

Theorem II, Case (i). The line l must be the intersection of tt 
and n'. Since BG and B'G' have a point in common, B, B' , G, C 
are linearly dependent, and hence BB' and GC have at least one 
point in common. If they have more than one point in common the 
four points are collinear, and must therefore lie on the intersection 
l of 77 and 77'. In this latter case A and A' cannot lie on l. Since AB 
and A'B' meet, we must have B = B'. Similarly G = C', and there- 
fore the triangles have two pairs of corresponding vertices coinciding. 
When two pairs of corresponding vertices of the triangles ABC, 
A' B'G' coincide, the triangles are clearly in perspective from any 
point on the join of the other two vertices. 

The case in which just one pair of corresponding vertices, say 
A and A', coincide is also easily disposed of. Then BB' and GC' 
meet, but do not coincide, and their intersection O is a point from 
which the two triangles are in perspective. 

Finally, we have the case where no two corresponding vertices 
coincide, and therefore the lines AA', BB', GC are each uniquely 
defined, and each has a single point in common with each of the 
other two. Let O be the point common to BB' and CC'. If O does 
not he on AA' , the points 0, A, A' determine a plane containing 
.4-4', BB' and CC, and therefore tt = tt ' , contrary to hypothesis. 
Therefore AA' passes through O, and the triangles ABC, A'B'C' 
are in perspective from O. 

We now turn to Case (ii), when the two triangles lie in a plane. It 
is desirable to deal, first of all, with certain special cases which may 
arise. 

Theorem I. If all three pairs, A and A', B and B', C and C 
coincide, the theorem is trivial. If two coincide, say B = B' and 
G = C', the fine BG satisfies the theorem. If only one pair coincide, 
say A — A', the line joining this to any point common to BG and 
B'C (there is at least one) satisfies the theorem. Again, if the lines 
BC, B'C coincide without any pairs of corresponding vertices 
coinciding, BC and B'C' are the only pair of corresponding sides 
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which coincide. Hence AB intersects A'B' in just one point R, CA 
intersects C'A' in just one point Q, and the line QR satisfies the 
theorem. 

Next, suppose that the corresponding vertices A, A'\ B, B'; 
C, C 1 are distinct, but that O coincides with at least one of the 
vertices, say A. Then B'C' satisfies all the requirements of the 
theorem. This is also true if a non-corresponding vertex, B' say, 
coincides with A and with O. 

Finally, if two vertices, say B and C’, coincide, but not in 0, 
the lines BG, B'C’ must coincide, and we are back at a case already 
examined. 

There only remains the case in which the seven points A, B, C, 
A’, B', C, O are all distinct, and corresponding sides, such as BG, 
B’C are also distinct. We call this the general case. We now consider 
the various special cases which may arise in proving Theorem II. 

Theorem II. We consider the possibility of the pairs A, A'; 
B, B'; G, C coinciding. If all three pairs coincide, the theorem is 
trivial. If two pairs coincide, say B = B', C = C' , A =j=d', the 
triangles are in perspective from any point on AA'. If only one pair 
coincide, say A = A', Bj= B', G 4= C, the triangles are in perspective 
from the point (any point) common to BB' and GC'. Again, if 
two corresponding sides coincide, say BG = B’C, the triangles are 
in perspective from a point on this line collinear with A and A'. 

Next, suppose that l coincides with BG. Then C'A' must pass 
through C, and B'A' must pass through B. The triangles are 
therefore in perspective from A'. Finally, if two non-corresponding 
sides coincide, say BC = C'A', but not in l, we must have G = C' , 
and we return to a case considered above. 

The only remaining case to be considered in proving this theorem 
is therefore the general case in which the seven fines BC, CA, AB, 
B'C', C'A' , A'B', l are all distinct, and the pairs of points A, A'; 
B, B'; C, C' are also distinct. 

We note that all the special cases considered above illustrate the 
two theorems, the proofs following directly from the propositions 
of incidence. We now prove the two theorems in the general case 
on the assumption that n> 2, that is, that there exists a point not in 
the plane n — it' . 

Theorem I (general case). Since there exists a point not in it, there 
exists a fine A through 0 which is not in n. Take two distinct points 
V and V on A, distinct from 0. The fines VV, AA' meet in 0, and 
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therefore lie in a plane. They do not coincide, because VV = A 
does not lie in tt. Hence VA, V'A ' have a unique point A * in common. 
Since A 4= A' (for we are in the general case) and VA is not in n, 
.4* is not in 7r. Nor is it on A, since this would imply A = OoiA' = O. 
Similarly VB, V'B' have a unique point B* in common, and VC, 
V'C have a unique point C* in common. The points A*, B*, C* do 
not lie in tt, and they are distinct from V and V'. If A*, B*, C* were 
linearly dependent, the points V, A*, B*, C* would lie in a plane 
which would be distinct from n. But this plane would contain 
A, B, C, and these would therefore be collinear points, contrary to 
hypothesis. 

Hence ABC, A*B*C * are two triangles in perspective from V. 
Therefore BC passes through the point P common to B*C* and tt, 
CA passes through the point Q common to C*A* and n, and AB 
passes through the point R common to A*B* and tt. 

Similarly A'B'C’ and A*B*C* are in perspective from V, and 
we deduce in the same way that B'C' , C'A' and A'B' pass through 
P, Q, R respectively. But P, Q, R are on the intersection l of tt 
with the plane of A*B*C*. This line l therefore satisfies the require- 
ments of the theorem. 

Theorem II ( general case). Through l draw a plane 7r' distinct from tt. 
P, Q, R are the intersections of BC, CA, AB respectively with l. 
If Q = R, then we must have A = A', which is not the case (since 
we are dealing with the general case). Hence P, Q, R are all different, 
and we can draw through them three lines in tt' to form a triangle 
A*B*C*. By the converse of the three-dimensional Desargues’ 
Theorem, ABC and A*B*C* are in perspective from some point V, 
and similarly A'B'C' and A*B*C* are in perspective from some 
point V'. If V = V we should also have V = V = A* = B*= C*, 
whereas the points A*, B*, C* are constructed so as to be distinct. 
Again, neither V nor V lies in tt, for if either did, we should have 
tt' = tt. The line VV is therefore uniquely determined, and meets 
tt in one point 0. Since VA and VA' meet in A*, VV and AA' 
meet in 0. Similarly BB’ and CC' pass through O. 

Thus we have proved Desargues’ Theorem and its converse, using 
only the propositions of incidence, except in the general case in two 
dimensions. We then had to make the additional assumption that 
the plane is contained in a space of higher dimension. This additional 
assumption can easily be justified for a space defined over a field K 
by the method of § 2, and we shall give a simple algebraic proof of 
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Desargues’ Theorem in this case. But we first remark that assuming 
Desargues’ Theorem in the plane, the converse follows without any 
additional assumption. We need only consider the general case. 

Let BC, B'C' meet in P, CA, O' A' in Q, and AB, A' B' in R. 
In the general case neither BB'R nor GC'Q can be collinear triads. 
We consider the two triangles BB'R, CC'Q, and we see that they 
are in perspective from P. Now, B'R and C'Q meet only in A', and 
RB and QC meet only in A. Hence, by Desargues’ Theorem in the 
plane, BB' and CC' meet on AA' . That is, the triangles ABC, 
A' B'C' are in perspective. 

We conclude this section with a simple algebraic proof of Theorem 
I which is valid as long as 0 is distinct from the vertices of at least 
one of the triangles. Let this triangle be ABC. We use the symbolic 
notation introduced in § 4. 

Since A' is on OA, and is assumed to be distinct from A, we may 
choose the multipliers of A’, B', C' so that 

A' = O + Aa, 

and similarly B' = 0 + BJ3, 

and C' = 0 + Cy, 

where a, ft, y are in K. Then 

P=B'-C' = Bfi-Cy 

is a point on B'C' and on BC. 

Similarly, <j = C ’ -A' - Cy-Aa 

is on C'A' and on CA, and 

R = A'-B' = Act- Bp 
is on A' B' and on AB. But 

P + Q + R — 0. 

Therefore P, Q, R are collinear. This proves the theorem. 

7. Some fundamental constructions. We now discuss some 
constructions which are of fundamental importance in the next 
chapter. They are valid in any P r n {K) for which n^2. 

Let 0 and U be any two distinct points in P r n (K). We suppose 
that the multipliers of these points are fixed, and we define a point 
E, which is distinct from 0 and U, by the equation E = O+U. 
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Let A, B be two points of OU which are both distinct from U. We 
may suppose that their multipliers are chosen so that 

A = 0+Ua, B = 0+ Ufi. 

We propose to show how to obtain the points 
0+ U(a + /3), 0+ Uafi 

by geometrical construction. In each case the construction will 
be given in italics, and the justification for the construction will 
follow. 

I. A construction for 0+U(a + fi). In any plane through the line 
OU draw three lines a, b, u through A, B, U, respectively . These lines 
are distinct from OU and do not all pass through one point. Let M be 
the intersection of u and a, B' the intersection of u and b, A' the inter- 
section of OB' and a, and L the intersection of U A’ and b. The inter- 
section P of LM and OU is the point 0+U(a + /3). [See Plate I.] 

By Proposition of Incidence V, if n is any plane through the line 
OU there exists a point X in n which is independent of O and U. 
The point Y = U + X is different from X, and does not lie on OU 
or on AX. Hence the lines AX, BY, UX are three fines through 
A, B, U in 7T which are each distinct from OU, and do not all pass 
through one point. Hence the first part of the construction, the 
selection of the fines a, b, u, is possible.* The points M, B' are 
uniquely defined. So also is the point A'. For if OB' intersected a 
in more than one point, we should have OB’ = AM = a. Then 
A,M,0,B' would be colfinear, that is, MB' would contain 0, and we 
should have u — OU, contrary to hypothesis. Similarly, the point L 
is uniquely defined. The points L, M can only coincide if both are at 
the intersection of a and b. But M is on u, and u does not pass 
through the intersection of a and b. Hence the fine LM is uniquely 
defined. This fine cannot coincide with OU, for M would then 
coincide with A, and we should have u = OU. The intersection P 
of LM and OU is therefore a unique point. We now show that 
P = O+U(a + 0). 

This is evident when one or both of the points A, B coincides 
with O, and we may therefore confine our attention to the case 
represented by the first two figures. In these cases A' is easily 

* This was proved in § 6, using only the propositions of incidence. Here 
we merely select the point Y on UX. 





7. SOME FUNDAMENTAL CONSTRUCTIONS 199 

shown to be distinct from A and M. Hence, in writing the 
equation 

M\ + A'/i+Av = 0 

which expresses the collinearity of M, A', A, the multipliers A and 
v are different from zero, and we may choose the multipliers of A 
and M so that 

M = A + A'/i = 0+ Ua + A'/i. 

Hence M — Ua = O + A'/i is on MU and on OA', and is therefore 
the point B'. We define the symbol for B' by the equation 

B' = O + A'/i. 

Now B' — B = O + A'/i— O— Ufi — A'y — Ufi, and therefore the 
point L, which is on A'U and B’B has the symbol 

A’fi-Ufi = B'-B 

Hen ce 

M - L = A + A > - A > + Up = A + Up = O + U(a + p) 

is the symbol of a point on LM and on OU. This point is P, and we 
have therefore proved that P = 0+ U(oc + f). 

II. A construction for 0+ Ua.fi. In any plane through the line 
OEU draw three lines a, b, u through A, B, U , respectively. These lines 
are distinct from OU and do not all pass through one paint. Let P be 
the intersection of u and a, Y the intersection of u and b, X the inter- 
section of EY and a, Q the intersection of OX and b. Then PQ meets 
OU in the point R = 0+ Ua.(l. 

By definition E = 0+U, and it is therefore different from 0 and 
U. As above, the selection of the fines a, b, u is possible. The points 
P, Y are distinct, and neither lies on OU. The intersection X of EY 
and a is unique. For otherwise EY = a, and Y would coincide with 
the intersection of a and b. But Y is on u, and o, b, u are not con- 
current. Again, X is distinct from O, since X is on EY, which does 
not coincide with OU, Y not lying on OU. Hence OX is defined. 
The point Q, the intersection of OX and b, is unique. For otherwise 
OX = b and we should have, in the first place, B = O. If B =f= O, we 
have a contradiction. If B = 0, and OX = b, we can still obtain 
a contradiction. For X and Y would fie on b, and since XY cuts 
OU in E, and b is distinct from OU, we should have E — B = O, 
contrary to hypothesis. Hence Q is unique. If P coincides with Q, 
it can only be at the intersection of a and b. Since P is on u, this 
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would contradict the fact that a, b, u are not concurrent. Hence the 
line PQ is defined, and since P is not on OU the line PQ meets OU 
in a unique point R. We now prove that R = 0+ Ua.fi. 



[Plate II shows the possible special cases.] 

The proof is again trivial in the cases in which A or B coincides 
with 0 or E, and it will simplify matters if we confine ourselves to 
the cases in which A and B are different from O or E. When A is not 
at E the points P, A and X are all different. In the first place P is 
on u and not at U, hence it is not on OU, and therefore cannot 
coincide with A. If P = X , then E, P, Y are collinear. But PY is u 
which does not contain E, and we have a contradiction. Finally, 
since X is on a and on EY, it can only coincide with A when A = E, 
which is contrary to our hypothesis. Hence in the relation 

PX + A /1 + Xv = 0 

which expresses the collinearity of P, A, X, the multipliers A, ji, v 
are all different from zero. 

Now if B is not at O, fi is not zero, and it follows that we can 
choose the multipliers of P and X so that 

P = Afi + X. 

Then P+U(l—a)fi=(0+Ua)fi+U(l—a)fi+X 

= (0+U)fi+X 
= Efi + X ; 

it follows that this is the symbol of a point on PU and on EX. 
Hence we may write 


Y= (0+U)fi+X. 


PLATE II 
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Then Y-B = (0+ U)p+X-0- Up 

= X+0(fi— 1); 

we have a point on YB and onlO which is therefore Q. Finally, 
P-Q = Ap+X-X-0(P~ 1) 

= 0 + Ua.p ; 

that is, the intersection of PQ and OU, namely R, is represented 
by the symbol 0+ Ua($. 

8. The condition for a commutative field ; Pappus* Theorem. 

The construction given in the last section for the point R = O+Ua/i 
is not symmetric in A and B, and if the roles of these two points 
are interchanged, the point R' = 0+ U Pol is obtained. This is dis- 
tinct from R, unless fia = OLp. In all the various cases considered 
in the last section, except in the first or ‘general’ case, the con- 
dition aj3 = Pol is satisfied; for either a = P, or one or both of a, P 
is zero or unity. We therefore confine ourselves to the first case. 



With the same figure as for Fig. 1 , let OF meet a in P' , and let 
XU meet b in Q'. Then P'Q' meets OU in R' = 0+ U Pol. Hence the 
condition that olP — p<x is the condition that the two collinear triads 
P'PX and QQ'Y are such that the intersection 0 of P'Y, QX and 
the intersection U of XQ', YP should be collinear with the inter- 
section of PQ and P'Q'. We can express this more briefly by saying 
that the intersections of the cross-joins of the two triads PP'X and 


P P' x 



O' Q T 


Q'QY are collinear. 
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We consider this theorem by itself, altering the notation. Let 
l, V be two coplanar lines, and let ABC and A' B'C' be two triads 
of points on l, l' respectively. Let P, Q, R denote the intersections 
of their cross-joins BC', B'C; CA', C'A ; AB', A'B. 

It can at once be verified that if the three points of either triad 
are not distinct the points P, Q, R, as long as they are determinate, 
are collinear. On the other hand, if one or more of the points P, Q, R 
is indeterminate, there is at least one line containing a point P 
common to BC' and B'C, a point Q common to CA' and C'A, and 
a point R common to A B' and A ' B. The theorem can also be proved 
at once when any one of the six points A, B, C, A', B' , C' coincides 
with 0, the intersection of l and l'. We notice further that if P, Q, R 
are not distinct, A, B, C, A', B', C' cannot all be distinct and 
distinct from 0. If, for instance, Q = R, and A 4= A', we must have 
B — C, B' = C', in which case P is indeterminate, or A = B = C, 
with P = Q = R = A. 

If ABC, A' B'C' are two triads of points on two coplanar lines 
l and V and the intersections of the cross-joins are collinear, we say 
that Pappus' Theorem is satisfied. We have seen that Pappus’ 
Theorem is always satisfied in the case when either triad contains 
a pair of coincident points, or a point coinciding with the inter- 
section O of l and V, and we need only consider the theorem when 
none of these special cases arises. 

We can prove Pappus’ Theorem for the case in which the fines 
AA’, BB’ , GC have a point in common. We apply Desargues’ 
Theorem to the triangles A'BC, A B'C'. We deduce that the points 
0, Q, R are collinear. Similarly, by considering the triangles A B'C 
and A'BC' we deduce that P, O, R are collinear. By hypothesis 
these are triads of distinct points. It follows that P, Q, R, 0 are 
collinear. 

On the other hand, if the ground field K is not commutative 
Pappus’ Theorem cannot be true for all pairs of triads. For there 
must exist a pair of elements a, fi in K such that afi 4= ficc. We per- 
form the construction (Fig. 2) given above for R = 0 + Ua.fi and 
R' = 0 + U fia. As we saw above, if Pappus’ Theorem is true for 
the triads PP'X, Q'QY, then R = R' and therefore afi = fia, 
contrary to hypothesis. We conclude that if Pappus’ Theorem is 
universally true in P r n {K), then A is a commutative field. 

We now show, conversely, that if the ground field K is com- 
mutative, Pappus’ Theorem is universally true in P r n {K). This 
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can be done by showing that any two triads can be taken as PP'X, 
Q'QY for a suitable Fig. 2, but the following proof is simpler. As 
explained above, we need only consider the case in which A, B, G, 
A', B', C’ are all distinct, and distinct from the intersection O of 
l and V. We denote the intersections of the cross-joins by P, Q, R. 



Then, since the special cases are excluded, P, Q, R are all distinct. 
If QR passes through 0, the converse of Desargues’ Theorem shows 
that the triads A’BC, AB’C are in perspective. Hence, as above, 
Pappus’ Theorem holds for the triads ABC, A'B’C’. We assume 
then that QR does not pass through 0, and cuts l in X, V in Y’ . It 
can be immediately verified that since A, B, C are distinct, and 
since also A', B', C are distinct, the point X is distinct from points 
of the first triad, and Y' is distinct from points of the second triad. 
We may therefore choose the multipliers of 0, X, Y’ so that 

A = 0 + X, B' = 0 + Y'. 

Since R is on AB' and on XY' , 

R = A-B' = X-Y'. 

Now, if B = O + X/3, A' = 0+ TV, 

A'B meets XY' in Xfi— Y'a,'. Since this is the point R, we must 
have a' = /?. 
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Again, if C = 0 + Xy, V = 0 + Y'y’, 

CA' meets XY'inQ = Xy- Y'a'. But C'A meets XY' in A - Y'y'. 
Since this is the point Q, we must have y'y = a'. 

Now consider the intersection P of BC' and B'G. The line BC' 
meets X Y' in the point Xa' — Y'y', and B'G meets X Y' in the point 
Xy — Y'. The point P is on X Y', that is, Pappus’ Theorem is true, 
if and only if Xa.'— Y'y' = (Xy - Y')y' , 

that is, if and only if , , , 

J yy' = a = y y. 

But this is always the case, since K is commutative. Hence we have 
proved 

Theorem I. A necessary and sufficient condition that Pappus’ 
Theorem be universally true in P^,[K) is that K be commutative. 

There are, of course, other geometrical theorems equivalent to 
the condition that K be commutative. But since Pappus’ Theorem 
is one of the simplest of these, and since it is traditionally taken as 
the geometric equivalent of the assumption of commutativity, we 
take it as our geometric criterion. 

9. Some finite geometries. The constructions given in §7 
were possible because we were able to draw a sufficient number of 
distinct lines through a given point to make the various inter- 
sections determinate. If the ground field K contains an infinite 
number of elements, we can draw as many distinct fines through a 
given point O as we please. For in any plane through O there exist 
two points, X, Y such that 0, X, Y are linearly independent. The 
fines joining 0 to X + Ya, where a ranges over K, are infinite in 
number, and we can always select from these a finite number which 
satisfy our requirements. 

If A is a finite field containing k elements (including 0 and 1) 
the line XY contains the k points X+Ya (a in K) and also the 
point Y. Through 0 we can therefore draw exactly k + 1 fines 
(& > 2); that is we can draw at least three distinct fines through O in 
the given plane. As we saw in § 7 this is a sufficient number to make 
the constructions given there determinate. 

The plane through 0 contains at least seven points, namely 

0, X, Y, O + X, O+Y, X+Y, O + X+Y. 

If K is the field of integers reduced modulo 2, in which case K 
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contains only the elements 0, 1, these seven points are the only 
points in the plane. Then, since 

O+X + X+Y+O+Y = 2(O + X+Y) = 0, 

the three points O + X, X+Y, 0+ Y are collinear. We shall return 
to this finite geometry of seven points in the next chapter. 


o 



x X+Y Y 


Fig. 4. 

The number of points in P r n ( K ), where K is the field containing 
only the elements 0, 1 , is 2 TC+1 — 1 . For in a given allowable coordinate 
system these are the points (a 0 , a„), where each a i takes the 
values 0, 1 independently, only the case in which every <x i is zero 
being excluded. The number of points in P l n (K) is the same, since 

w = nm- 

In the same way we can show that if K is the field of integers 
reduced modulo p, so that K contains only the elements 

0, 1, ..., p-1, 

the number of points in P r n {K) = P l n {K) is 

pn+l _ 1 

~P~ 1 

10. r - way spaces. Many generalisations of P'^K) are possible. 
Only one will be used in this work. The ground field K is here taken 
to be commutative. 

We consider r projective number spaces 
PN ni (K) (i = 1, 

and suppose that 

(4 4*) (* = l,...,r) 
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are the coordinates in allowable coordinate systems of points in 
these r spaces. Now consider an ordered association of these points 

* = 

These are called the points of the r-way projective number space 

PN ni ...nr(K). 

An allowable transformation of PN ni nr (K) is defined to corre- 
spond to a projective transformation of PN ni (K), a projective trans- 
formation of PN nt (K ), ..., a projective transformation of PN nr (K). 

We consider a set S of elements which can be put in one-to-one 
correspondence with the points of PN ni ^(K). This correspondence 
establishes a coordinate system in S. The allowable transformations 
of PN ni nr (K) lead to new correspondences, and hence to new 
coordinate systems in S. The elements of S, taken together with 
these allowable coordinate systems, constitute the points of r-way 
projective space P ni nr {K) . In fact P ni nr (K) is simply an aggregate 
in one-to-one correspondence with all the sets of points (A v ...,A r ) t 
where A v ...,A r are arbitrary points of P n jK), ...,P nr (K), the 
coordinate systems being obtained by associating allowable 
coordinate systems in each of these spaces. 



CHAPTER VI* 


PROJECTIVE SPACE: SYNTHETIC 
DEFINITION 

1. Propositions of incidence. In Chapter V we defined a 
projective- space of n dimensions over a field K, and derived [V, § 4] 
certain theorems which we called the Propositions of Incidence. 
The object of this chapter is to consider to what extent these 
propositions characterise a projective space. 

We begin by postulating the existence of certain objects which 
we call linear spaces of 0,1,2,... dimensions (or points, lines, 
planes, ...) and a relation which may exist between a linear space 
S p of p dimensions and a linear space S q of q dimensions. If this 
relation connects S p and S q , we say that S p lies in S q , and write 

8 p z8 q . 

This relation is also written 

S q =>S p , 

and is then read ‘ S q contains S p \ 

If P 0 , ...,P p are any p+ 1 points we say that they are linearly 
dependent if there is a linear space S q of q dimensions such that 

Pi^S a {i = 0,...,p), 

where q <p. Otherwise the points are said to be linearly independent. 

We now postulate the following properties for the incidence 
relation £ [cf. V, §4]. 

I. If S h c S k and 3 k <=. S h then S k = S h . 

II. If Sp c S a and S q c S,, then S p c S r . 

III. Every fine contains at least three distinct points. 

IV. Given any p + 1 linearly independent points, there is at 
least one linear space of p dimensions which contains them. 

V. Any linear space of p dimensions contains at least one set of 
p + 1 linearly independent points. 

VI. If P 0 , ...,P p are p+ 1 linearly independent points which lie 
in an S q , any S p containing them is contained in S q . 

* This chapter is almost completely independent of the rest of the book, 
and may be omitted at a first reading. 
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VII. If P 0 , ..., P p are p + \ linearly independent points of an 8 p , 
and Q 0 ,...,Q q are q + 1 linearly independent points of an 8 q , and if 
the p + q + 2 points P 0 , ..., P p , Q 0 ,...,Q q are linearly dependent, 
there exists at least one point R which lies both in S p and 
mS q . 

VIII. There exists a finite integer n > 1 such that there is at 
least one set of n + 1 linearly independent points, but any set of m 
points, where m > n + 1, is linearly dependent. 

That these are a set of consistent postulates is proved by the 
example of a projective space, defined in the previous chapter. 

We now draw certain conclusions about the relation £ , about 
linear dependence, and, later on, about the intersections of linear 
spaces. 

(1) Each linear space S h lies in itself. For by V, 8 h contains h + 1 
linearly independent points. By VI, any S' h containing these points 
lies in S h . Choose S' h = S h , and the result follows. 

(2) If S p £ S q , then p^q. For by V there is a set of p + 1 linearly 
independent points lying in 8 P . By II these lie in S q . If q<p, 
these points would be linearly dependent. Hence p < q. 

(3) If S p £ S' p , then S p = S' p . For if P 0 , ...,P p are p + 1 linearly 
independent points of S p , by II they are p + 1 linearly independent 
points of S'p. Hence, by VI, S p £ S p , and therefore, by I, S p = S' p . 

Corollary. The S p containing p + 1 linearly independent points 
is unique, and any set of p + 1 independent points of an S p serves 
to determine it. 

We shall call any set of p + 1 linearly independent points in an 
S p a basis for the S p . 

(4) Let P 0 , ...,P p be p+ 1 independent points, and suppose that 
P 0 ,...,P S is a linearly dependent subset. Then P 0 , ...,P S lie in some 
S t , where t<s. Let Q 0 ,...,Q t be a basis for S ( , and consider the 
q+1 = p-8 + t+l points Q q ,...,Q t , P s+v ...,P p . If these are 
linearly independent they determine an S q . In any case, they lie 
in an S r , where r^q<p. Now, S r contains Q 0 , ...,Q t and hence, by 
VI, S t . Therefore, by II, S r contains Po,-,P a . Also, S r contains 
^s+i> ••• >Pp • Hence Po, -,Pp are contained in S r ( r<p ) and are 
therefore linearly dependent. From this contradiction we deduce 
that any subset of a linearly independent set of points is necessarily a 
linearly independent set. 

Corollary. The points of a linearly independent set are all 
distinct. 


HP I 


14 
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(6) Let P 0 P p be a basis for S p \ Q 0 ,...,Q q a basis for S q . If 

S p and S q have no common point it follows from VII that P 0 , P p , 
Qo> •••>Qq are linearly independent. Suppose that S p and S Q have 
at least one point in common. Considering the set of common points 
there exists an integer r (0 < r < n) such that there is at least one 
set A 0 ,...,A r of r+1 linearly independent points common to S p 
and S q , but any set of m points (m>r+ 1) common to S p and S Q 

must be linearly dependent. Now, A 0 A r determine an S r . 

Since A^S p (i = 0, ...,r), it follows from VI that S r <=S p . Simi- 
larly S r £ S q . Hence r < min (p, q). Also, if the point R lies in S r , 
R^S r ^ S p , and therefore, by II, R £ S p . Similarly R £ S q . On the 
other hand, if the point B lies in S p and in S q , the points A 0 , ...,A r , 
B are linearly dependent, and therefore lie in an S t (t^r). Since 
A t ^S t (i = 0, ...,r), S r £ S t and therefore r^t. Hence r = t and 
S r = S t . Therefore the points common to S p and S Q are all the points 
of a linear space S r . If now S t is any space lying both in S p and in 
S q , and C 0 , ..., C t is a basis for S t , then the points of this basis lie in 
S p and in S q and therefore in S r . Hence S t lies in S r . 

(6) With the notation of (5) we prove the converse of VII, that 
if S p and S g have at least one point (and therefore all the points of 
an S r ) in oommon, the p + q + 2 points P 0 ,...,P p , Q 0 ,...,Q q are 
linearly dependent. If p > r there exists a point in S p which is not 
in S r . Otherwise S p £ S r , and hence p^r. Let B r+1 be a point which 
is in S p but not in S r . Then A 0 , ...,A r , B r+1 must be linearly in- 
dependent. Otherwise we could show, as above, that B r+1 is in S r . 

Hence A 0 A r , B r+l determine an S r+1 in S p . If p > r + 1 we can, 

similarly, find another point in S p , B r+2 , say, such that A 0 , ...,A r , 
j B r+1 , B r+ 2 are linearly independent. If we continue in this way we 
obtain a basis A 0 ,..., A r , B r+1 , . . . , B p for S p whose first r+1 points 
are a basis for S r . In the same way we can construct a basis A 0 , A r , 
C r+1 , ...,G q for S q . If the points A 0 ,...,A r , B r+V ...,B p , C r+1 , ...,C q 
are linearly independent, they determine a linear space oip + q — r 
dimensions. In any case they determine an S t (t^p + q — r). Since 
S t contains A 0 ,...,A r , B r+l , ..., B p , it contains S p , and therefore, 
by II, P 0 ,...,P p . Similarly it contains Q 0 ,...,Q q . Hence the set 
P 0 , ...,P p ,Q 0 , ...,Q 4 liesin S t (t^p + q-r<p + q+ 1) and is therefore 
linearly dependent. 

(7) We now prove that in fact the points A 0 , ...,A r , B r+1 , ..., B p , 
C r+1 , ...,C q are linearly independent. For, let us suppose that they 
are linearly dependent. By construction, A 0 , ...,A r , B r+1 , ...,B p 
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are linearly independent and determine S p . Also, C r+1 ,...,C q is a 
subset of a linearly independent set. By (4) above this subset is a 
linearly independent set, and it therefore determines an S q _r_ v 
By VII, S p and have a point, R, say, in common. Since 

C t £ S q (i = r+ 1, $ 8 _ r _ x £ S q , and therefore R lies in S q and 

in S p . Hence, by (5) above, it is in S r . Since S r and $ 8 _ r _ x have the 
point R in common, the points A 0 ,...,A r , C r+V ...,C g must be 
linearly dependent, by (6) above. This contradiction proves that 
A 0 , ...,A r , B r+1 , ..., B p , C r+1 , ..., C 8 are linearly independent. These 
points therefore determine an S p+q _ r , the result still holding, as at 
the beginning of (5) above, if S p and S q have no point in common, 
provided that we put r = — 1. Clearly S p £ S p+Q _ r , and S q £ S p+q _ r . 
On the other hand, if S k is any linear space such that S p £ S k , and 
also S q ^S k , then A 0 , ...,A r , B r+1 , ..., B p , C r+V ...,G q are all in S k . 
Therefore S p+Q _ r ^ S k . Finally then, S p+q _,. is contained in every 
linear space which contains both S p and S q . If we call this space the 
join of S p and S q , the space S r being called the intersection of S p 
and S q , we have proved 


Theorem I. The dimension r of the intersection of two linear 
spaces S p and S q is connected with the dimension t of their join by the 

relation „ . , 

r + t=p + q. 


We know, by VIII, that there exists a set of n + 1 linearly in- 
dependent points. Let these be A 0 ,...,A n . They define an S n . 
If R is any other point, then, again by VIII, A 0 , ...,A n , R are linearly 
dependent, and therefore lie in an S m (m^n). Since A i ^S m 
( i = 0, ...,»), it follows that S n £ S m and therefore, by (1) above, 
n < m. Hence S m = S n , and all points (and similarly all lines, 
planes, ...) lie in S n . The complete configuration of points, lines, 
planes, . . . lying in S n is said to form an incidence space of n 
dimensions. 

A set of n + 1 linearly independent points in S n is called a simplex. 
Let A 0 ,...,A n be a simplex. The points A 0 , ...,A f _ x , A i+1 , ...,A n 
are linearly independent, and determine an S n _ t (or prime) E i , 
which we call a face of the simplex. 

For the sake of later applications [§ 6] we now prove that there is 
a point E which does not lie in any face of the simplex A 0 , ...,A n . 
We prove this by induction on n. If n = 1, the result is simply 
Postulate III. We therefore suppose that we have proved the 
theorem for a space of n — 1 dimensions, in particular for 27°, the 

X4-2 
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face of the simplex which does not contain A 0 . Let E 12n be a 
point of 2° which does not he in any face of the simplex A v ...,A n 
of that space. Since any point common to 27° and 27* ( i > 0) lies in 
a face of this simplex, E n n does not he in any 27* (i > 0). Since 
A 0 is not in 27°, the points A 0 and E n n must be hnearly indepen- 
dent. They determine, in consequence, a hne l 0 . This hne cannot 
lie in 27°, since A 0 is not in 27°; nor in 27* (i = 1, since E 12 n 
does not he in 27*. The hne meets 27* (i = 0, ...,») in unique points, 
namely, in the points E 12 n , A 0 , ...,A 0 . Now, l 0 contains at least 
one point E distinct from A 0 and E 12 n , and this point satisfies 
our requirements. 

Now let i 0 , . . . , i n be any derangement of the integers 0, . . . , n, and 
consider the points A ik+i , ...,A in (k ^ 0). These are hnearly in- 
dependent. If, therefore, the set A ik+i , ...,A in , E was hnearly 
dependent, E would he in the space determined by A ik+1 , ...,A in , 
and therefore in 27*'°. This is contrary to hypothesis. The n — k+l 
points considered therefore determine an S n _ k . Now, A io ,...,A ik 
determine an S k , and since the join of S k and S n _ k is the whole 
space S n , the spaces S k and S n _ k have just one point in common. 
We denote this by E io _ ik . The points E u „, ik , A ik+l , ...,A in are 
hnearly independent. 

If E ioik lay in any of the faces of the simplex A io , ...,A ik of S k , 
say in that face determined by A iv ..., A ik , S n _ k would he in the 
space determined by A h , ...,A ik , A ik+1 , ..., A in , that is, in 27*°. 
Hence 27*° would contain E, and this contradiction proves that 
E io i k does not he in any of the ( k — l)-spaces determined by the 
points A io , ...,^._ 1 , Ai. +1 , ...,A ik . 

We obtain in this way 2 n+1 — 1 points E ioi ik , including E t = A { 
and E 0 n = E. It should be noted that any incidence space of n 
dimensions contains such a set of points, at least, and that they 
are ah distinct. 

Let us now consider the space S k determined by A io , ...,A ik . In 
this space let us use the point E io - ik , which does not he in any of 
the faces of the simplex A io ...,A ik , to construct points E' jo ja , 
where j 0 , is a subset of i 0 ,...,i k . Arrange i 0 , ... ,i k so that 
(j 0 , • • • ,j 8 ) = (^o> Then E' jo Ja is the intersection of the space de- 
termined by A io ,...,A is with that determined by E ioh , A is+1 , ...,A ik . 
But this last space is the intersection of the space determined by 
A io , ...,A ik , or S k , and the space determined by E, A w ..., A in . 
It follows that E' jo jt = E ja so that no new points are obtained 

in f.Viisi wav. 
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We shall always assume that n > 1 in subsequent sections, the 
case n = 1 being of little interest. Indeed all that we can deduce 
from the propositions of incidence when n = 1 is that there are at 
least three points, and that two points are linearly independent if 
and only if they are distinct. 

2. Desargues’ Theorem. This theorem is fundamental for 
the development of this chapter. Since the proof of both Theorem 
and converse given in V, § 6 was based only on the propositions of 
incidence, except in the case n = 2, we may state the theorems 
without further proof for the case n > 2. 

Theorem I. If ABC and A'B'C' are two triangles in perspective 
there exists a line l passing through a point common to BC and B'C', 
a point common to CA and C' A' and a point common to A B and A' B' . 

Theorem II. If ABC and A'B'C' are two triangles such that there 
is a line l passing through a point common to BC and B'C ' , a point 
common to CA and C' A' and a point common to A B and A' B', the 
triangles are in perspective. 

We shall refer to these theorems as Desargues’ Theorem and its 
converse. 

The theorems were only proved from the propositions of in- 
cidence in the case n = 2 for a number of special cases. The con- 
nection between the algebraic proof given in V, § 6 and the pro- 
positions of incidence is necessarily not very clear at present. But 
we have no a priori reasons for thinking that the case n = 2 is not 
a consequence of the propositions of incidence. However, we now 
show, by means of a counter-example, that when n = 2 the theorems 
cannot be deduced from the propositions of incidence. In fact, we 
construct an aggregate of points and lines which satisfy the pro- 
positions of incidence in the case n = 2, but for which Desargues’ 
Theorem is not true. This construction is due to F. R. Moulton. 

As a preliminary, we consider a Euclidean plane referred to 
rectangular Cartesian coordinates ( x , y), and in it a certain set of 
loci, which we call L-loci. An L-locus is defined by the equation 

V = mf(y,m)(x-a), 

where m, a are real numbers, and f(y, m) is defined as follows: 

(i) ifm<0, f{y,m)= 1; 

(ii) ifm>0, f{y,m)=\, whenycO, 

f(y, m) = when y > 0. 
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We also include the case m = oo, the line x = a being an L-locus, 
and m = 0, a = oo, the line y = b being an L-locus. 

It is clear that an L-locus is completely determined by that part 
of it which lies in the region 0, or by that part which lies in y < 0. 
Moreover, if an L-locus has ra > 0 (and is therefore not a line), the 
line joining any two points of it has positive slope. Bearing these 
facts in mind, it is easily seen that there is an L-locus joining any two 
points A, B, of the plane, and that it is uniquely determined by 
these points. In fact, if A and B both lie in y > 0 the upper half of 
the L-locus is the upper half-line through A and B; and this deter- 
mines the L-locus uniquely. A similar result holds if A and B both 
lie in y < 0. Suppose now that A lies in the region y > 0, and B in 
y< 0. If the line AB has negative slope, the unique L-locus con- 
taining A and B is the line AB. If, finally, AB has positive slope, 
let A be (x 1 ,y 1 ), and let A' be the point (x v 2y x ). Let A'B meet 
y = 0 in M. Then the L-locus containing A and B is necessarily 
that determined by the half-line MA. 

We show now that any two distinct L-loci 

y = mf(y,m)(x-a), y = nf{y,n)(x-b) 

(m + w) have a unique point in common. This is evident if both 
m, n are negative, or, by the above remarks, if one of m, n is negative. 
If both ra, n are positive, and we solve the equations 

y = m(x — a), y = n{x-b), 

we find that y = mn(b — a)j(n — m); 

and solving the equations 

y' = \m{x’ -a), y' = \n(x' -b), 

we find that y' — \mn(b — a)l{n — m). 

Thus y gives the intersection of the L-loci if ( b — a)j(n — m ) is nega- 
tive, and y' if this ratio is positive. This proves the result, and we 
deduce again that one and only one L-locus joins any two points. 

If m = n the L-loci do not meet, unless they coincide. They are 
then said to be parallel. 

The process by which we pass from a Euclidean plane to a pro- 
jective plane by the introduction of elements at infinity is well 
known. In this projective plane Desargues’ Theorem is, of course, 
true. We can now follow a similar process to arrive at a set of 
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elements for which our postulates (with n - 2) are satisfied, but 
Desargues’ Theorem is not true. To distinguish the points and lines 
of this space from those of the original Euclidean plane, we shall 
call them Points and Lines. The points of the Euclidean plane are 
Points. Corresponding to each set of parallel L-loci we postulate a 
further Point (at infinity) which is regarded as a Point of each locus 
of the set. The Lines are the L-loci of the Euclidean plane together 
with one other element, the Line at infinity, which contains all the 
Points at infinity. It is now a simple exercise to verify that the 
Plane which contains these Points and Lines, and the Points and 
Lines themselves satisfy the Propositions of Incidence of p. 208. 

In this Plane we now construct a configuration which violates 
Desargues’ Theorem. The construction involves certain points and 
lines in the Euclidean plane which are treated as Points and Lines. 
It is important to observe the difference between points and lines 
on the one hand, and Points and Lines on the other. The justification 
of the assertions concerning the elementary geometry involved may 
be left to the reader. 



Fig. 6. 


We construct two triads ABC, A'B'C' for which the lines AA', 
BB ' , CC are all parallel to O Y , C and C' lying on 0 Y , for simplicity, 
A on OX, and the intersections of corresponding sides lying on a 
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line TJVW parallel to OX in the upper half of the plane. We also 
suppose that the slopes of BC, B'C', CA', CA are negative, while 
those of AB, A'B' are positive. Then the Lines AA', BB', CC' are 
parallel, and therefore concurrent, the Lines BO, B'C' intersect 
in U, the Lines CA, C'A' in V. The Line AB is the same as the 
half-line AB (since A is on OX), and passes through W, whereas if 
A' is on the opposite side of OX to B', as in the diagram, the line 
A'B' passes through W, and therefore the Line A'B' cannot pass 
through W. Hence ABC, A' B'C are two triads in perspective 
whose corresponding sides do not intersect in the points of a Line, 
and Desargues’ Theorem does not follow from the Propositions of 
Incidence when n = 2. An even simpler example than the above 
shows (as we should expect) that the converse of Desargues’ 
Theorem is also not necessarily true in an incidence S 2 . 

Geometries in which Desargues’ Theorem is not true have been 
the subject of numerous researches, but since they cannot be identi- 
fied with the geometry of Chapter V we are not concerned with 
them here. We add to our postulates 

Villa. If n = 2, Desargues’ Theorem is true. 

We only need to postulate Theorem I of the present section, and 
not the converse, Theorem II, since, as we saw in V, § 6, the latter 
can be deduced from the former. 

3. Related ranges. A set of points P, Q, R, ... on a line l is 
called a range of points on l. A set of coplanar lines p,q,r, ... through 
a point L is called a pencil of lines through L. From the Propositions 
of Incidence it follows that if 0 is any point not on l, and l' is any 
line in the plane joining 0 and l which does not pass through 0, 
l' will intersect any line of the pencil 0(P,Q, R, ...) in a point, 
giving a range of points P' , Q', R', ... on V . The ranges on l and l' 
are said to be in perspective from 0, and we write 

(P,Q,R,...)°(P',Q',R',...). 

Such a perspectivity establishes a one-to-one correspondence 
between the points on l and those on l' . 

If l contains only k points, l' contains only k points, and every line 
in the plane contains exactly k points. Through O there pass exactly 
k lines, and through every point in the plane there pass exactly k 
lines lying in the plane. By Postulate III, k > 3. We shall see that 
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there are geometries in which a plane contains only a finite number 
of points, and our methods must be devised to take account of these 
finite geometries, especially when k = 3. The Propositions of In- 
cidence assure us of the existence of 7 points in a plane in this case, 
and there are finite geometries in which a plane contains exactly 
7 points [V, § 9]. 

Returning to our perspectivities, suppose that another per- 
spectivity has been established between the points of l' and those of 
a line l", the vertex being O'. The line V must cut l', but need not 
lie in the plane joining 0 to l, which we write 0(1). We describe the 
perspectivities thus: 

(P, Q, R, . . . ) £ (P', Q', R', . . . ) & (P", Q ", R ", . . .). 

A one-to-one correspondence, which is not necessarily a perspec- 
tivity, is set up between the ranges on l and l", and we say that these 
ranges are projectively, or homographically, related , or simply that 
they are related ranges. We write 

7{(P\Q",R",...) 

in this case. 

The process may be continued, and we say that two ranges are 
related if they can be put into a one-to-one correspondence by means 
of a finite chain of perspectivities. It is clear that in a perspectivity 
distinct points are projected into distinct points. Hence, if two 
ranges are related, distinct points in one range correspond to dis- 
tinct points in the other range. 

An important problem in the theory of related ranges is to 
determine when two given ranges, which are in a one-to-one corre- 
spondence, are related by a series of perspectivities. The following 
theorem is a first step towards an answer. We shall use the term 
projectivity for series of perspectivities. 

Theorem I. Given two triplets of distinct points P x , Q v R 1 and 
P 3 , Q 3 , R 3 lying, respectively, on two distinct lines l l and l 3 , there is a 
projectivity which assigns P v Q lt R x to P» Q 3 , R 3 respectively. 

Since P v Q v R l are distinct, and is distinct from l 3 , at most one 
of these points lies on l 3 . Similarly, at most one of the points Pa> Qa> 
R 3 lies on l v Hence we can always choose one pair of points which 
are not required to correspond in the projectivity, say P 1 and Q z , 
neither of which lies on both and l 3 . Let P 1 Q 3 be the line l a . 
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The join Q X Q 3 is distinct from l x and l 2 . On it take a point O x 
which is not on either of these lines, and let 0 1 R 1 meet l 2 in R 2 
(O x and R x lie in the plane O x (l 2 )). Since l 3 and l 2 meet, P X P 3 and 
R 2 R 3 meet. Indeed, their intersection is unique, since P 3 4= R 3 , and 
P X P 3 has a single point in common with l 3 . Let this intersection be 
O a . (It is not necessarily distinct from O v ) Then 



Fig. 0. 

An immediate corollary is that a projectivity which is the result 
of three perspectivities, at most, can be set up between two ranges 
on the same line, so that three distinct, arbitrarily chosen points 
correspond to three distinct, arbitrarily chosen points. 

It may be thought that the projectivity established between l x 
and l 3 is a special one, because it is the result of only two perspec- 
tivities. This is not the case, for we now prove 

Theorem II. If a range of points on a line l x is related by a chain 
of m perspectivities to a range of points on a line l m+1 4= l v this pro- 
jectivity is equivalent to at most two perspectivities. 

Two preliminary theorems are necessary. Consider a sequence 
of two perspectivities between the lines l x and l 2 , and l 2 and Z 8 . We 
shall call l 2 the intermediary line for the projectivity between l x 
and l 3 . l x and l 2 must intersect: let them meet in the point L 12 , 
and let l 2 , l 3 intersect in the point L 23 ; the points L l2 and L 23 may 
or may not coinoide. If they do, we have 
Theorem III. If l v l 2 , l 3 are concurrent and l x 4= l 3 the sequence 
of two perspectivities between the points of l x and l 3 is equivalent to a 
single perspectivity. 
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Let (P 1> Q 1 ,R 1 ,...)°^(P 2 ,Q i> R 2 ,...), 

and (P 2i Q 2 ,R 2 , ...) j? 3 (P 3 ,Q 3 ,R 3 , ...). 

There is nothing to prove if 0 12 = 0 23 , so we may suppose that 
0 12 4= We consider the triads Q 1 Q 2 Q 3 . and we see that 



these are in perspective from L X2 = Z* 23 . If both triads are proper 
triangles we are necessarily in the general case of Desargues’ Theorem, 
and the sides P 1 P 3 ,Q 1 Q a intersect on the join 0 12 0 23 of the two centres 
of perspective at a point 0, say. Thus Q X Q 3 , and similarly R X R 3 , 
passes through 0, the intersection of 0 12 023 with P X P 3 . Thus 

(Pl> Ql> -®1 •••) ^ (p3>Q3>^3 •••)• 

If the points Pv P& P 3 are colhnear, which is the case when P x lies 
on 0 12 0 23 , the point 0 may be taken at the intersection of Q X Q 3 
with 0 12 0 23 = P 1 P 2 P 3 . If both sets of points P x , P 3 , P 3 and Q x , Q 3 , Q 3 
are collinear then we must have 0 12 = 0 23 . 

If, on the other hand, L X2 4 = L 23 , we have 

Theorem IV. In the sequence of two perspectivities between the 
points of l x and l 3> the intermediary line may be replaced by any other 
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line which joins a pair of points on Z x and l 3 which are neither 
corresponding nor coincident. 



We apply the previous theorem twice. Let l* be the line joining 
X x of Z x to 7 3 of Z 3 . By hypothesis, X x and Y 3 are distinct, and do 
not correspond in the projectivity between Z x and l 3 . Let us suppose, 
in the first place, that X x does not correspond to L 23 , the inter- 
section of l 2 and l 3 . Since L 23 is self-corresponding in the perspec- 
tivity , centre 0 23 , between l 2 and l 3 , we deduce that X x L 23 does not 
pass through 0 X2 . We take the line X 1 L 23 as l 2 . It satisfies the 
requirements of our theorem, since X x and L 23 are not corre- 
sponding points. 

We can project the range (P 2 , . . . ) on l 2 from 0 X2 into the range 
(P 2 , ...) onZ 2 . For 1' 2 lies in the plane L 23 (l x ) = 0 12 (i x ). By a sequence 
of two perspectivities, vertices 0 12 and 0 23 , we have set up a pro- 
jectivity between the ranges (P 2 , . . . ) on l 2 and (P 3 , . . .) on l 3 , and the 
lines, Z 2 , l 2 , l 3 are concurrent. By the previous theorem there is a 
point 0 23 on 0 X2 0 23 such that 

Similarly, since X x and F 3 are not corresponding points, and X x is 
self-corresponding in the perspectivity, centre 0 X2 , between Z x and 
l 2 , the line X x F 3 = l 2 does not pass through 0' 23 . If we now project 
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the range (P 2 , ...) on l 2 into the range (P* , •••) on if from the point 
0 23 , then again, by the previous theorem, since if, l 2 , l x are concurrent, 
there exists a point 0' 12 on 0 12 0 23 and therefore on 0 12 0 23 , such that 


The line if therefore satisfies the requirements of our theorem, and 
may be used instead of i 2 as an intermediary line. 

If X x corresponds to L 23 , the argument fails. If L 12 does not 
correspond to Y a we can prove the theorem, as above, by taking 
L X2 Y 3 in place of i 2 as the first intermediary line used in our argu- 
ment. There remains the case in which X 1 corresponds to L 23 , and 
L 12 corresponds to Y a . We consider this case. 

On ij there exists at least one other point besides L 12 and X v Let 
T x be such a point. If i 3 meets l v and T x is the point of intersection, 
the point T a of l a which corresponds to T x is different from T v unless 
the join 0 12 0 23 of the centres of perspective passes through T v 
Suppose that 0 X2 0^ passes through T x , and that T x = T a . There are 
two possibilities: 

(i) There is a point Z a on l a distinct from T a , L 2a , F 3 . We can pass 
from the intermediary line l 2 to the intermediary line l* by taking 
as intermediary lines, in order 


L x o L 


12 " L/ 23> 


Lia Yi a , 


Z^Xi, X x Y a , 


none of which joins corresponding points, or passes through the 
intersection of l x and l a . 

(ii) Every line contains only three points. Then the ranges on 
l x and l a are in perspective from the intersection of X x L 2a and 
Y a L n , and this point is 0 12 = 0 23 . The case in which O l% — 0 23 is, 
of course, trivial. 

Finally, if l x and l 3 meet, and T a and T x are different, or if l x and 
l 3 do not meet, and T a corresponds to T v we replace Z 3 by T 3 in the 
proof of (i), above. This concludes the proof of Theorem IV. 

We are now in a position to complete the proof of Theorem II. 
In order to do this we first consider the case of four lines l v l 2 , l 3 , l v 
with ranges (P^ ...), (P 2 , ...), (P 3 , ...), (P 4 , ...) on them, and suppose 
that 


(P V ...) 0 £(P»:.). 


(P l ,...) 0 A n (P„->, 


(P S ,...) 0 «(P 4 ,...). 
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By the definition of perspective, Z x and Z 2 are distinct intersecting 
lines, and similarly for Z 2 and Z 3 , and for Z 3 and Z 4 ; but other coinci- 
dences of the lines l v l 2 , Z 3 , Z 4 consistent with these restrictions are 
permissible, and must be taken into account. 



Fig. 9. 

We first prove that if l x and Z 4 are distinct lines we can pass from 
the range (P 4 , ...) to the range (P 4 , ...) by two perspectivities. We 
first observe that in the case in which each line of the configuration 
contains only three points the result follows by the argument used 
to prove Theorem I. Indeed, the ranges on Z x and Z 4 each consist of 
three points and are in one-to-one correspondence. Using the 
argument adopted to prove Theorem I it follows at once that we can 
pass from the range on Z 4 to the range on Z 4 by two perspectivities. 

In proving the result generally we may now confine ourselves to 
the case in which each fine of the configuration contains at least 
four distinct points. We first consider the cases which arise 
when the lines Z l5 l 2 , Z 3 , Z 4 have certain special mutual relationships. 

(i) l v Z 2 , l 3 distinct but concurrent. By Theorem III, we can pass 
from (P x , ...) to (P 3 , ...) by one perspectivity, and the result follows 
at once. 

(ii) l 2 , l 3 , Z 4 distinct but concurrent. This case follows in exactly 
the same way as (i). 

(iii) l x = l 3 , Z 4 not through L n = L 23 . Then there is a line l* 
through L 12 , different from l 3 , which meets Z 4 in a point different 
from a possible intersection of Z 2 and Z 4 and from the point of Z 4 
which corresponds to L 12 on Z 2 . By Theorem IV we may replace 
h ^7 Z 3 and then apply (ii). 

(iv) Z 2 = Z 4 , Z x not through L 23 = L u . This case follows as in (iii). 

(v) Z x = Z 3 , Z 2 = Z 4 . Let m be any line through L u = L 23 = L 3i 
different from Z x = Z 3 and from Z 2 = Z 4 . From a point V x in the plane 
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of l x and m we project the range (P v ...) on l x into the range (P, ...) 
on m. Then 

ip,...) 5* (p„ ...) °“(p„ ...) °“(p„ ...)° a m (p.,...). 

Using Theorem III, we obtain, in turn, 

(P,...)^(i$,...) (*=1,2, 3, 4) 
for suitably chosen points V 3 , V x . Hence we have 

and indeed, by another application of Theorem III, we see that 
the ranges on l x and Z 4 are in perspective in this case. 

The cases considered above deal with all the cases in which l v l 2 , l 3 
are concurrent, or Z 2 , l 3 , Z 4 are concurrent. The remaining cases are 
first, the general case illustrated in Fig. 9, and the cases in which 
l v l 2 , Z 4 are concurrent (but all different) or l v l 3 , Z 4 are concurrent (but 
all different). These last possibilities cannot occur simultaneously, 
otherwise we should be in a case already considered. We need only 
consider the case in which l v l 2 , Z 4 are not concurrent (whether 
l v l 3 , l x are concurrent or not), since the case in which l v l 2 , l t are 
concurrent and l v l 3 , l x are not concurrent is obtained by inter- 
changing the roles of l 2 and l 3 . Our argument, of course, includes 
the general case. 

Since l v l 2 , l x are not concurrent we may replace l 3 by a line l* 
joining L 12 to a point of / 4 not on l 2 , nor corresponding, in the range 
on l v to Lj 2 regarded as a point of l 2 . We then have l v l 2 , l 3 con- 
current, and may apply (i). This completes our proof of Theorem II 
in the case m = 3. 

Now consider m perspectivities, that is, the general case of 

Theorem II. The case in which each line has only three points can 

be dealt with as when m — 3, and may be omitted. If, in the sequence 

oflines , r , , , ox 

Zj., Z 2 , l 3 , ..., l m+ i (w* > 2), 

we can find a line which is different from l {+3 we can apply the case 
m = 3 to reduce the number of perspectivities. We proceed in this 
way until either we reach the case of two perspectivities, or else 

k = h + 3 (* = 1 m — 2). 
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In this case m > 3 since we have assumed l x =4= l m+1 . Then l v l 2 , Z 8 are 
distinct lines. We can, by Theorem IV, replace l 2 by a different line 
1$ without altering any other Then, since 1$ 4= l 5 we can apply 
the result for m = 3 to shorten the sequence of perspectivities. Thus 
in all cases we can reduce the number of perspectivities to two and 
our theorem is proved. 

Corollary. A one-to-one projective correspondence between two 
ranges on the same straight line can be obtained as the result of 
at most three perspectivities. 

4. Harmonic conjugates. On a line l take two distinct points 
A and B. If C is any point of l we construct a new point D on l 



In a plane n through l take three lines a, b, c passing respectively 
through A, B, C, each distinct from l and not all meeting in one 
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point. Since at least three distinct lines pass through every point in 
n, it is possible to choose a, b, c to satisfy these conditions. Let b and 
c meet in P, c and a in Q, and let a meet b in the point R. Then A 
cannot coincide with P, nor B with Q. Since A, B, P, Q are coplanar 
points, and not collinear, AP and BQ meet in a point. Let this be 8. 
Then S must be distinct from R, since B is distinct from A . The line 
R8 has a unique intersection D with l. This is the point we seek. 

It is immediately verified that if C = B then D = B, and if 
C — A, then D = A. In these cases the construction of D does not 
depend on the particular set of lines a, b, c chosen. We prove now 
that if C is distinct from A and B the point D is independent of the 
coplanar set of lines a, b, c chosen: in other words, D is determined 
only by the ordered triad A, B, C. 

We first examine the case in which there are only three points 
A, B, C on l. Then D must coincide with one of these points. If 
D — A, then R, 8, Q, A must be collinear, and hence Q = S or Q = A. 
If Q — 8, then R = P, and a, b,c are concurrent, contrary to hypo- 
thesis; if Q — A, then G — A, which is again contrary to hypothesis. 
Similarly we cannot have D = B, and so the only remaining possi- 
bility is D = C. Hence in this case too, the position of D does not 
depend on the particular choice of A, B, C. 

We may now confine ourselves to the case in which l (and hence 
every line) contains at least four points, and through any point 
there pass at least four lines which lie in any given plane through 
the point. We consider any plane n' (not necessarily distinct from n) 
through l, and in it we draw three lines o', b', c' through A, B, C 
respectively, these fines being distinct from l and not concurrent. 
We repeat the construction given above, denoting the points 
corresponding to P, Q, R, S by P', Q', R', S'. Let R’ S' meet 
l in D'. 

Consider, first of all, the case in which the fines a, o'; 6, b'; c, c' are 
all distinct. Corresponding sides of the triangles PQR, P'Q'R' 
intersect inA, B,C, which are collinear. Hence, by the converse of 
Desargues’ Theorem, these triangles are in perspective. If P = P', 
then b = b', and c = c', contrary to hypothesis; for similar reasons 
Q4=Q', R±R'. Thus the fines PP', QQ', RR' are determinate, and 
they have a point 0 in common. If two of these fines coincide, say 
QQ' = RR' , then QR = Q'R', that is, a = a', which is contrary to 
hypothesis. Therefore O is determined by any two of the three 
lines. 


HP I 


15 
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Again, corresponding sides of the triangles PQS, P'Q'S' intersect 
in B, A, C, and therefore these triangles are in perspective. Hence 
S' and O, the intersection of PP’ and QQ' , are collinear. The 
triangles QRS, Q'R'S 1 are therefore in perspective from 0. Let us 
assume, first of all, that SQ=t S'Q'. Since SQ meets S'Q' in B, and 
QR. meets Q'R' in A, Desargues’ Theorem shows that RS and 
R’S' have a point in common on BA = l. Since RS meets l only in 
D, and R' S' meets it only in D' it follows that D — D'. 

If SQ = S'Q', but SP #= S'P' we may use the triangles PRS, 
P’ R' S' in place of QR S, Q'R'S' and prove thatD = D' . USQ = S'Q' 
and SP = S'P' then S = S' and QQ', PP' meet in S. Since PQR, 
P'Q'R' are in perspective, RR' goes through S = S'. Hence 
RS — R'S' meets l in the same point D. 

This proof may break down if the lines a, a’ \ b, b and c, c' are not 
all distinct. For instance, if c = c', the triangles PQR, P'Q'R' are 
in perspective from the intersection of RR' with c, and PQS, 
P'Q'S' are in perspective from the intersection of SS' with c, 
but since PP' = QQ' we cannot deduce from these facts that 
QRS, Q'R’S' are in perspective. We must therefore modify our 
proof. 

We note that if c = c' the planes n, n' coincide. We show that if 
the lines a, a'; b, b'\ c, c' are not all distinct there exists at least one 
other set of lines a *, b*, c* through A, B, C satisfying the require- 
ments of the construction, such that a, a*; b, b*\ c, c*, and a', a*; 
b', b*; c’, c* are distinct sets of fines. The case in which c = c' is 
typical. Since through each point of l there pass at least four 
distinct lines of n we can find lines a* and b* through A and B 
respectively distinct from l, a, a’ and l, b, b' respectively. Let b* 
meet a* in R*. Since c = c', there is at least one fine c* in n which 
passes through C and is distinct from c — c' , l and CR*. The fines 
a*, b*, c* satisfy our requirements. We use them to construct a 
point D*. By the proof given above D = D* and similarly D' = D*. 
It follows once more that D = D'. 

This unique point D is called the harmonic conjugate of C with 
respect to A and B, and is denoted by 

D = (A,B)/C. 

It is clear from the construction that 


D = (B,A)/G. 
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We now prove that 

C = (A,B)/D = (B,A)/D. 

To construct the harmonic conjugate of D with respect to A and B 
we draw three lines AQR, B8Q, DSR through A , B, D respectively 
to form a triangle QSR. Then BR and AS intersect in P, and QP 
meets l in the harmonic conjugate of D with respect to A and B 
(or B and A). This is the point C, so that the result is proved. 

We saw in § 3, Th. I, that a projectivity can be established between 
the points A, B, C, and the points A', B', C , these being two 
triplets of distinct collinear points. We now prove that such a 
projectivity makes ( A , B)/C correspond to {A' , B')jC' . 

If in fact G = A or C = B then we must have C = A' or C' = B', 
and in these cases the result is evident. We therefore suppose that 
A, B, C (and therefore A', B', C') are distinct. Since a projectivity 
is the result of a finite number of perspectivities, we need only 
consider ranges A, B, C, A', B', C' on distinct lines l, l', which are 
in perspective. 



(i) A = A'. Let the ranges be in perspective from R. To construct 
D = (A,B)/G we draw the lines AR, BB', GB' through A, B, C, 
respectively, to form the triangle B'QR. Since AB' , BQ meet in S, 
RS meets l in the required point D. To construct D' — ( A B')IC', 
we draw through A', B' , C' the lines A'QR, B'QG and C'GR 
respectively to form the triangle CRQ. Since A'C and B'R meet 
in B, BQ meets l' in D' . Therefore D' — S. Hence (A, B, G,D) and 
(A',B\ C',D') are in perspective from R. 


IV* 
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(ii) B — B'. This case follows at once from (i) since 
(A,B)/C = (B,A)/C. 



(iii) C = C'. If AB' and A'B meet in S, and RS meets l in D, 
V in D', we construct (A,B)/C by means of the lines ^4^4', BB', 
CA'B', and ( A',B')jC' by means of the lines A' A, B'B, CAB. It 
follows at once that D = (A, B)/C, and D' = (A', B')/C'. Hence 

(A,B,C ,D)*(A',B',C\D'). 

(iv) Let l and V meet in a point distinct from A, B, C, A', B', C', 
and suppose that 

(A,B,C,D)° (A',B',C',D'), 

where D — {A, B)/C. To prove that D' = (A', B')/C', join CA' and 
let this line meet ODD' in D v and OBB' in B v Since 

(A,B,C,D)°(A',B V C,D 1 ), 

and D = ( A,B)/C , it follows from (iii) that D x = {A', B^/C, and 
since 

(A', B v C, Dj) ^ (A’, B', C’, D'), 
it follows from (i) that D’ = (A', B')jC. 
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These results are usually stated in the following form: 

Theorem I. Harmonic conjugacy is protectively invariant. 

An important corollary is 

Theorem II. If there exist three distinct points A, B, C such that 
G = {A, B)jC, then, given any three points A', B', C' , with A' 4= B', 
we have C' = (A' , B')jC' . 

This is evident if C’ — A’ or C' — B'. If C’ is distinct from A' 
and B', there is a projectivity which takes A', B', C' into A, B, C. 
By Theorem I this takes (A', B')jC' into ( A,B)IC . Since, by hypo- 
thesis, this last point coincides with C, we must have 

O' = (. A',B')/C' . 

When this happens we shall say that we are in the special case. 

For the purpose of completing the relationship between the pairs 
A , B and C, D, we find it convenient to define (A, B)jG when A = B. 
The definition varies with the circumstances. If we are in the 
special case we allow any point of the line to be D = (A,A)/C. If 
we are not in the special case we define (A, A) 1C to be A when 
(7 + A, and to be any point of the line when C = A. 

We now prove that if D = ( A , B)/C, then A — ( C,D)/B . It may 
be seen at once that this follows from the definition of harmonic 
conjugacy, with the above convention, if the points A, B, C,D are 
not all distinct. We need only consider the case, therefore, in which 
A, B,G,D are four distinct points on l. 
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With the notation of Fig. 10 let PD meet RA in Q', and QD meet 
RB in P' . The lines RD, AP, BQ pass through S, so that the sides 
of the triangles RAB, DPQ must intersect in collinear points. 
Therefore P'Q' passes through C. 



To construct ( C,D)jB we draw the lines DP', CP and BPP' 
through D, C and B, forming the triangle PP'Q. The lines DP and 
CP' meet in Q', and QQ’ meets l in the required point, which is 
therefore A. To sum up, we have proved 

Theorem III. The harmonic relation is symmetric in A, B and in 
C, D and also in the pairs (A, B) and ( C,D ). 

We may now describe (A, B) and (C, D) as harmonically conjugate 
pairs. The harmonic property is not of much interest in the special 
case, but it is of fundamental importance in all other cases. 

If we are not in the special case we cannot extend § 3, Th. I and 
find a projectivity which relates four distinct points A, B, C, D of 
one range to four distinct points A', B', C' , D’ of another. For if 
D= ( A,B)/C , but D'^(A', B')/C', the existence of such a pro- 
jectivity would contradict Theorem I. 

Finally, if (A,B), ( C,D ) are harmonically conjugate, and 0 is 
any point not on the line ABCD, consider the lines a = OA, b = OB, 
c = OC, d = OD. If V is any line in the plane of these lines which 
does not pass through 0, and cuts these lines in A’, B', C', D' 

respectively, then n 

{A,B,C,D)^{A',B',C',D'), 
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and so {A' , B') are harmonically conjugate with respect to ( C',D '). 
We express this property of the pencil of lines a, b, c, d by saying 
that (a, b) are harmonically conjugate with respect to (c,d). 

5. Two projectively invariant constructions. We re-examine 
the constructions already given in V, §7, and prove that they are 
projectively invariant. Their importance lies in the fact that they 
can be used to construct a field by which we shall introduce co- 
ordinates into the incidence space [§§ 7, 8]. 

On a line l take three distinct points 0, E, U. Let A, B be two 
points of l, each distinct from U. 

Construction I. In a plane n through l draw three lines a, b, u 
through A, B,U, respectively. These lines are to be distinct from, l and 
must not all pass through one point. Let P be the intersection of u and a, 
Q the intersection of u and b, R the intersection of OQ and a, S the 
intersection of UR and b. Then PS intersects l in a definite point T. 

After justifying this construction we shall show that T depends 
on O, U, A, B only, and not on the construction lines a, b, u. The 
first part of the construction, the selection of the three lines a, b, u, 
is possible because, as we have already seen [§ 3], at least three 
distinct lines pass through every point in n. The point Q cannot 
coincide with O, and so the line OQ is defined. The point R is unique, 
since even when A — O, P^Q. Since R is on a, R 4= U, and therefore 
UR is determined. The points P and S are distinct, and their join 
is distinct from l, since u does not go through the intersection of a 
and 6, and does not coincide with l. Hence PS meets l in a definite 
point T, and since PS is distinct from u, T=f= U. 

The ‘general’ case is illustrated by Plate III, (i). We first 
show, however, that T is independent of the construction lines 
a, b, u in the following special cases, in which the points 0, A, B 
are not all distinct (the numbering refers to the diagrams in 
Plate III): 

(ii) A = 2? + O. We see that T = (A,U)jO, performing the usual 
construction for the harmonic con j ugate , using only lines of the figure . 

(iii) 0 = A. Evidently T = B. 

(iv) 0 — B. Here T = A. 

(v) A = B = 0. We see that T = 0. 

If in a projectivity the points 0, A, B, U become the points 
O', A', B ' , U' , the point T becomes the point T' in the above special 
cases, since coincident points become coincident points, and 
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harmonic conjugates become harmonic conjugates. In these cases, 
then, the constructions are projectively invariant. 

We now suppose that 0, A, B, U are distinct. We prove that 
P, Q, R, S are distinct points, no three of which are collinear. By 
construction, P and Q are distinct points. Also OQ^b, and Q does 
not lie on a. Hence R is not on b, and, in particular, R + Q. If R = P 
then we should have QR = QP; but QR meets l in O, and QP meets 
l in U. Since 0+ £7, it follows that P+P. Again, since 17+ A, 
UP UR, and since U^B the points Q, S are distinct. Finally, 
R is not on 6, and therefore S 4= R- We conclude that P, Q, R, S are 
distinct points, no three of which are collinear. 

If R were on l, R would be A, and OQ would be l; that is, Q would 
lie on l, contrary to our construction. Similarly, if S were on l, S 
would be B, and R would lie on l, which we have seen is not the case. 
Hence none of the points P, Q, R, S lies on l. These facts will help 
to clarify certain points in subsequent proofs. 

We must show that if n' is any plane through l (which may 
coincide with n) and o', b’ , u’ are lines of n’ through A, B,U respec- 
tively, all distinct from l and not concurrent, then the construction 
given above, with a, b, u replaced by o', b' , u' respectively, leads to 
a point T' which coincides with T. 

Case (i). n^n'. By the converse of Desargues’ Theorem the 
triangles PQR, P'Q'R' are in perspective, corresponding sides 
intersecting at the collinear points 0, A, U. Similarly, the triangles 
SQR, S'Q'R' are in perspective. Hence there is a point common to 
the lines PP', QQ', RR’, SS', so that the triangles SPQ, S'P'Q' 
are in perspective. Hence SP meets S'P' on the line UB; that is, 
T = T. 

Case (ii). n = n'. If the configuration lies in a space of dimension 
n > 2, then the uniqueness of the point T is at once established by a 
double application of (i); the two constructions in n = n' are com- 
pared with a construction in a plane through l distinct from n = n r . 
But if n = 2, we must proceed differently. We first establish the 
uniqueness of T for the cases, three in number, in which two and 
only two of the coincidences a — a', b = b', u = u' occur. Having 
done this, we observe that there are at least three fines in n distinct 
from l through each of the points A, B, U, for l has been assumed to 
contain at least four distinct points 0, A, B, U. The reader may now 
easily verify that we can pass from construction fines a, b, u to 
construction fines o', 6', u' by a series of steps in each of which only 
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one of these lines is altered. Thus we have only to consider the three 
following special cases (iia), (ii6), (iic). 

(iia). a = a', b — b', u^u'. Let PS meet QR in Y. Clearly XT is 
the harmonic conjugate of XU with respect to the distinct lines 
XA, XB. Similar^, XY' is the harmonic conjugate of XU with 
respect to XA, XB. Hence X, Y, Y' are collinear points. The 



triangles PQY, P'Q'Y' are therefore in perspective from X. Since 
PQ meets P'Q' only in U, and QY meets Q'Y' only in 0, it follows 
from Desargues’ Theorem that PY meets P'Y' in a point of OU ; 
that is, T = T\ 

(iib). a = a', b 4= b', u = u'. Consider the triangles QRS, Q'R'S'. 
Corresponding sides meet in the points U, B, 0 respectively. Hence 



the triangles are in perspective. But QQ' meets RR' in the one 
point P. Therefore P, S, S' are collinear, and hence T = T' . 
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(iic). o+a', b = b', u = u'. The triangles PRS, P'R’S' are in 
perspective from Q. Since PR meets P'R' only at A, and RS meets 



R'S' only at U (if 8 = S', R = R', since O^U, and then a = a'), 
it follows from Desargues’ Theorem that PS meets P’S' on A U. 
That is, T = T\ 

The three special cases having been examined, the uniqueness of 
T is now completely established. 

We now proceed to establish the projective invariance of the point 
T; that is, we shall prove that if there is a projective correspondence 
between l and V such that 

(A,B, U,0,...)A(A',B', V\ O',...), 

and if T' is the point obtained by applying Construction I to 
A', B', U', O', then 

(. A,B , U, 0, T, ...) A (A',B' y V', O', T', ...). 

We need only prove the invariance of the construction in question 
under a perspectivity. 

If n ^ 3, and the ranges on l and l' are in perspective from V, there 
are planes n, n' through l, l' respectively which do not contain V. 
We perform Construction I on l in n, and project the configuration 
in 7 T from V on to n'. The projected figure gives Construction I on 
l' , and therefore the projective invariance of T is proved. 

If n = 2 we must use other arguments. As we have already noted, 
we need only consider the case in which A, B, 0, U are all distinct. 
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We suppose that A, B, 0, U and A', B', O', U’ are two ranges on 
lines l, V respectively, which are in perspective from V. 

Case (i). 0 = O’. To construct T we use the lines AV, BV, UB’ 
for a, b, u. Then, in the notation of the figure, PQ meets l in T. To 
construct T’ we use the lines AV, BV, V'B through A', B’, U' 
respectively. Then SR meets l’ in T'. Let PB'U, SBU' meet in X. 
Evidently OX is the harmonic conjugate of OF with respect to l 
and V. Similarly, if A'U meets A U' in Y, OY is the harmonic con- 



jugate of OF with respect to l and V. Hence, 0, X, Y are collinear 
points, and If meets BB' in (B, B')jV . Projecting from X, we see 
that XY meets A A' in (P, S)/V, and, projecting from Y, it meets 
BB' in (Q, R)/V. Hence, the intersection Z of PQ, SR is on XY. 
Now, the corresponding sides of the triangles T’U’R, TUQ meet 
at Y, Z, 0 respectively, and therefore, by the converse of Desargues’ 
Theorem TT' passes through F, the intersection of UU', QR. 

Case (ii). U = U' . To construct T we use the lines AV, BV, l' 
for a, b, u respectively. If OB' meets A V in R and UR meets BV in 
S, then .4 'S meets l in T. To construct T' we use the lines A V, BV, l. 
If O'B meets AV in R' , and UR' meets BV in S', then AS 1 meets 
V in T'. Let B'R meet A'S in X, and BR' meet AS’ in X' . Then 
VX is the harmonic conjugate of VU with respect to V A, VB, and 
so is VX'. Hence F, X, X' are collinear. Similarly, if BR' meets 
B'R at L, and AS’ meets A’S at M, we see that UL and UM are 
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both harmonic conjugates of UV with respect to UO, V O'. Hence 
U, L, M are collinear. Now consider the triangles OXT, O'X'T'. 



Corresponding sides intersect at the collinear points M , U, L. The 
triangles are therefore in perspective. But 00' and XX' meet 
only in V. Therefore T, T' , V are collinear. 

Case (iii). The intersection of l and l' is distinct from 0 and U. 
Join O'U, and let this line cut V A, VBin A*, B*. Using O', A*, B*, 
U, we construct T*. By Case (ii) above TT* passes through V, 



and by Case (i) above T*T' passes through V. Hence TT' passes 
through V, and the proof of the projective invariance of Construc- 
tion I is complete. 

We shall return to a study of this construction later. We note 
here that given O and U the point T is determined by A and B in- 
dependently of the order in which they are taken; that is, T is symmetric 
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in A and B. For, having already shown that T is invariant, we may 
use the line VSR as u, and although this interchanges the parts 
played by A and B, we evidently obtain the same point T in the 
only case we need examine, namely A =j= 0, B^O. 

Construction II. In a plane n through l draw three lines a, b, u 
through A, B, U respectively. These lines are to be distinct from l and 
must not pass through one point. Let P be the intersection of u and a, 
Q the intersection of u and b, R the intersection of EQ and a, S the 
intersection of OR and b. Then PS intersects l in a definite point T. 



After justifying this construction we shall show that T depends 
on the points A, B, U, O, E, but not on the construction lines a, b, u. 
As in Construction I, we can select three lines a , b, u to satisfy the 
given conditions. The points P, Q are distinct, and neither lies on l. 
The intersection R of EQ and a is unique. For otherwise EQ = a, 
and Q would coincide with the intersection of a and b, so that a, b, u 
would be concurrent, which is not the case. Again R is distinct from 
0, since R is on EQ, which does not coincide with l, Q not lying on l. 
Hence OR is defined. The intersection S of OR and b is unique. For 
if not, we should have OR — b, so that B — 0, and since Q lies on b, 
we should also have QR = b, so that E = B. But we cannot have 
E = B = 0, for E is distinct from 0. Hence S is unique, and it 
does not coincide with P, because P does not he on b. The fine PS is 
therefore determinate, and since P does not he on l, PS has a unique 
intersection T with l. Finally, T 4= U. For if T = U, S = Q, and 
QR = SR, and therefore 0 = E, which is contrary to hypothesis. 

To show that T is independent of the plane n and of the con- 
struction lines o, b, u, we first consider a number of cases in which 
A, B,U, 0, E are not ah distinct. These are illustrated in Plate IV. 
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(i) If A = O, or B = 0, the construction shows that T = 0. 

(ii) If A = E, we see that T = B. 

(iii) If B = E, we find that T — A. 

(iv) Suppose that A = B, but that A and B are different from 0 
and E. Let O' = E, U' = A = B, A' = O, B' = U. Taking O', V' , 
A', B' in place of the four points 0, U, A, B of Construction I, we 
draw OBS, UQP, BSQ through A', B' y U' respectively. Using 
these lines as the lines a', b' , u' of Construction I it is clear that PS 
meets Z in the invariant point T' of that construction. Hence T = T' 
does not depend on the choice of lines through A, B, U, and is 
projectively invariant with regard to the points O, E, A = B, U. 

Hence, to prove the uniqueness of T and its projective invariance, 
we may confine ourselves to the case in which O, E, U, A, B 
are all distinct, and therefore every line contains at least five 
points. Then T is distinct from 0, A, B and U. 

As in Construction I, we show that in this case P, Q, R, S are 
all distinct points, no three are collinear, and none lies on Z. Now let 
tt' be any plane through Z, and draw fines a', b', u' through A, B, U 
respectively which are not concurrent, and do not coincide with Z. 
Complete the construction for T' . We show that T = T' . 

Case (i). The triangles PQB, P'Q'R' are in perspective, 

since corresponding sides meet at the collinear points E, A, U. 
Similarly, the triangles QRS, Q'R'S' are in perspective, corre- 
sponding sides meeting at the points O, B, E. Hence the triangles 
PQS, P'Q'S' are in perspective, and therefore PS meets P'S' on 
the fine UB. Therefore T = T’ . 

Case (ii). n = n’. If the configuration lies in a space of n > 2 
dimensions, we can establish the uniqueness of T by comparing 
the constructions with a third construction in a plane through Z 
distinct from n = n', using the result of Case (i). If, however, n = 2, 
we must proceed differently. We consider three special cases. 

(iio). a=t=o', b = 6', u = u'. We use the notation of the previous 
figure. The triangles PRS, P'R'S' are in perspective from Q. Now 
PR and P’R’ are distinct, and meet only in A. Also RS =f= R'S'. 
For if R = R', R = R' = A , and then E = A, contrary to hypothesis. 
Similarly, if S = S', OS = OS' = OR = OR', and therefore RR’ 
contains 0, so that 0 = E, which is also contrary to hypothesis. 
Hence if RS = R'S' we must have RR’ = SS', which implies 
E = B, which is also not the case. Hence RS meets R'S' in the one 
point 0. Applying Desargues’ Theorem to the triangles PRS, 
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P'R'S', we see then that PS meets P'S' on OA. That is, 
T= T'. 



(ii&). a = a' , 6=j=6', u — u' . Since &#=&', QaQ', and therefore 
R*R'. Also, RR' = a = QQ'. Now, QR meets Q' R' at E, RS 



meets R'S' at O, and QS meets Q' S' only at B. Hence the triangles 
QRS, Q'R'S' are in perspective. But QQ' and RR' meet only in P. 
Therefore P, S, S' are collinear points, and so T — T' . 


HP I 


16 
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(iic). a = a', b = b' , «#=«'. Let a, b meet in X, let PS, RQ meet 
in Y, and P'S' , R'Q' meet in Y'. Let RS meet u in V, and let R'S' 
meet u' in V. The triangles RQV, R'Q'V' are in perspective, since 
corresponding sides meet at U, O, E respectively. Since RR' and 



QQ' meet only in X, it follows that VV' passes through X. Now, 
X Y is the harmonic conjugate of XV with respect to XA, XB, and 
X Y' is the harmonic con j ugate of XV with respect to the same pair of 
lines. Since XV — XV', X, Y, Y' are collinear points. Now consider 
the triangles PYQ, P'Y'Q'. By what we have just proved, these 
triangles are in perspective from X. Since PQ meets P'Q' only at 
U, and YQ meets Y'Q' only at E, it follows that PY meets P’Y' 
on l. Hence T = T ' . 

Having proved that T = T' in these special cases, we can proceed, 
as in the case of Construction I, to prove that T = T' for any two 
constructions in the one plane which use the same distinct points 
0, E, U, A, B. Since at least four lines distinct from l pass through 
each of the points A, B, U we can pass from the first construction 
to the second by a series of alterations of one of the lines a, b, u. 
At each change T remains invariant, since we have one of the three 
special cases considered above, and finally we reach the lines 
a', b', u' of the second construction. 

We now prove that if there is a projective correspondence such 

that ( 0,E,U,A,B ,...) A (0',E’, U',A',B ', ...) 
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and if T' is the point obtained by applying Construction II to the 
points O', E', U' , A' , B' , then 

(O, E, U, A, B, T) A {O', E', U',A', B', T'). 

We need only consider the case when the two ranges are in per- 
spective from a vertex V. We may suppose that 0, E, U, A, B are 
all distinct, the cases in which A, B coincide with 0 or E, or A — B 
having already been considered. We suppose in the first instance 
that 

(i) 0 = O'. To construct T and T' we use the lines VAA', VBB' 
first as a, b, then as a', b'. Let EA' meet VB in P. We take UP as u, 



meeting a in Q, and QB' then meets l in T. Let E'A meet VB in 
P'. We take U’P’ as u' , meeting a' in Q' , and then Q'B meets l' in T' . 
Let EA' meet E'A in X, BT' meet B'T in F, and let PQ meet 
P'Q' in Z. Finally, denote the intersections of PQ and l', and of 
P'Q' and l, by H' and H respectively. 

Consider the triangles A' PH', AP'H. Since corresponding sides 
intersect at Z, O, X respectively we can deduce that HH' passes 
through V if we prove that Z, 0, X are collinear. But this follows 
at once from the fact that the triangles EPU, E'P'U' are in per- 
spective from V, since corresponding sides of these triangles inter- 
sect at Z, 0, X respectively. The triangles H'QB', HQ'B are now 
seen to be in perspective, vertex V. Since corresponding sides 
intersect at Y, O, Z it follows that 0, X, Y, Z are collinear. It is 
clear that OX is the harmonic conjugate of OV with respect to l 
and V. If TT' meets BB' in K, the harmonic conjugate of OK with 
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respect to l and l' is 0 Y , which coincides with OX. Hence OK = OV, 
and therefore TT' passes through V. 

(ii) Now let U = U’. This case can be deduced from the one just 
proved by observing that if we rename the points 0, E, U, A, B by 


writing 


(0,E,U,A,B) = (U,E,0,B,A), 


and construct T from the points 0, E, U, A, B, we have T — T. 
Thus the projective invariance of T when U = TJ' is covered by the 
previous case. 

Finally, let the lines l, V meet in a point distinct from O and U. 
If l* is the line O'TJ we proceed in two stages, first considering the 
ranges on l and l*, and then the ranges on l* and V , and the proof 
of the projective invariance of T is completed as in the case of 
Construction I. 

The preceding proofs of the invariance of the points T obtained 
by the two constructions can be greatly simplified when n = 2 if 
we may assume that every line contains an unlimited number of 
points. It is then possible to ensure that all the required inter- 
sections in the uniqueness proofs are determinate. Alternatively, 
the assumption that we are not in the special case mentioned 
in §4 leads to simplifications. Our purpose has been to provide 
proofs which are valid when n = 2 without the use of these 
assumptions. 

We conclude this section with some remarks on the various con- 
structions we have discussed. It will be observed that in the 
‘general’ cases of Constructions I and II a set of four coplanar 
points P, Q, R, S enters, with the property that no three of the 
points are collinear. These points form a quadrangle with three pairs 
of opposite sides: PQ, RS; PS, QR ; PR, QS. The respective inter- 
sections Z, X, Y of these pairs of opposite sides are the diagonal 
points of the quadrangle, and the joins of the diagonal points are 
the diagonals. 

If we apply the usual construction for (R, S)/Z we see that this 
point is the intersection of RS and X Y. Hence X, Y, Z are collinear 
if and only if we are in the special case. 

Let l be any line in the plane which is distinct from the sides of 
the quadrangle. It meets the sides in six points which form three 
pairs, each pair corresponding to a pair of opposite sides. We can 
select a set of three points from the six, one belonging to each pair, 
in eight possible ways. These triads fall into pairs, two paired triads 
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making up the whole set of six points. The sides of the quadrangle 
to which a given triad belongs either form a triangle or are con- 
current. In the first case we say the triad is a triangle triad; in the 
second case we say it is a point triad. Thus, in the figure, A, B, C is 
a triangle triad, A ' , B', C' a point triad. If a given triad is a triangle 
triad (point triad) the triad with which it is paired is a point triad 
(triangle triad). If two triads have one point in common they are 
of like type; if they have two points in common they are of opposite 
type. 



It may, of course, happen that the pairs are not all distinct. 
Suppose, for instance, that C = C' (= Z). Then A, B, C may 
be regarded as both a triangle and a point-triad, and so may 
A', B', C ; but A, A', C and B, B' , C, which also have one 
point in common, may be regarded as each consisting of two tri- 
angle triads. 

Our Constructions I and II are, essentially, constructions for 
completing our quadrangular set of six points, given five of them. 
In I, one pair consists of the point V counted twice, and another is 
the pair A, B. The construction is then simply a method for deter- 
mining the third pair of the quadrangular set, given one of its 
points, namely O. The fact that triangle triads are point triads, 
and conversely, explains why the construction for the point T is 
symmetric in A and B. In II, the given points are paired as (O, U) 
and (A, B), and we are given one member E of the third pair. 
The problem is then to construct T so that E, B, U is a point- 
triad. 


246 VI. PROJECTIVE space: synthetic definition 

If we denote the quadrangular set of points which occurs in II by 

Q(OEA, UTB), 

where OEA is a point triad, UTB a triangle triad, and 0 and U, 
E and T, A and B are respectively the intersections with the line 
of the set of the pairs of opposite sides of the quadrangle PQRS, 
we have proved that 

The property of being a quadrangular set Q(OEA, UTB) is in- 
variant under any projectivity. 

6. Reference systems. In an incidence space of n dimensions 
there exists at least one set of n + 1 linearly independent points. We 
call such a set a simplex. We saw in § 1 that there is a point E which 
is not in any face of the simplex A 0 , ...,A n , and we found 2 n+1 — l 
points E itj ik which are the unique intersection of the S n _ k deter- 
mined by the points A ik+i , ...,A in , E with the S k determined by the 
points A io , ...,A ik . The point E t ik does not lie in any face of the 
simplex A ia > ...,A ik of S k . Also, E t — A 0 and E 0 » = E. 



We now construct a further set of \n(n+ 1) points F i} 
(i,j = 0, . . ., n; i 4=j). Since A i and A i are distinct, and E i} is distinct 
from each of these points, the harmonic conjugate of E tj with respect 
to A { and A i is uniquely determined. It is always different from A i 
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and Ap and coincides with Ep if and only if we are in the special case 
mentioned in § 4. We denote this harmonic conjugate by F i} = F }i . 

To construct consider the plane A i A j A k through A t Ap 
and in this draw A { E jk , AjA k , E {J A k through A p Ap E {j respec- 
tively. These lines form the triangle A k E ijk E jk . Since A t A k and 
A } E iJk meet in E ik , it follows that E ik E jk passes through (A t A i )IE i p 
that is, through F t p This is true for all values of k. 



Suppose, first, that we are not in the special case. Then E ip E jk , 
E ki are not colhnear. The triangles A t AjA k , E jk E ki E ij are in per- 
spective from E ijk . Hence the intersections of corresponding sides 
are collinear; that is, F jk , F ki , F ti are collinear. If we are in the special 
case, E t p E jk , E ki are colhnear, and so F i} = Ey, F jk = E jk and 
F ki — E ki are again colhnear. 

We now prove that the %n(n+ 1) points F i} all he in an S n _ v 
This result has just been proved when n — 2, and we prove the 
theorem by induction on n. We suppose that the theorem is true for 
the configuration in E°, the face opposite the vertex A 0 . That is, 
we assume that the points Fy ( i,j = 1, ...,») he in an S n _ 2 . If ah 
the points F M he in this S n _ 2 the result is evident. Suppose that 
F 01 is not in this S n _ r Then F 01 and S n _ 2 determine a prime, S n _ t 
say. Consider the point F oi . Since the points F oi , F 01 , F u are colhnear, 
and F ov F 1{ are in (Sf n-1 and are distinct, the point F oi hes in S n _ t 
also, and the theorem is proved. 

It is easily seen that the points F {j cannot all he in an S k (k < n — 1). 
For if they do, and A 0 is a vertex of the simplex which does not lie 
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in this S k , the vertex A 0 and S k determine an S k+1 . This S k+1 contains 
A 0 and F oi , which is distinct from A 0 . Hence S k+1 contains the Une 
^4 0 ^4 <> and in particular it contains A t . Therefore S k+l (k+l<n) 
contains the set A 0 ,...,A n , contradicting our assumption that 
these points are linearly independent. The above results still hold 
in the special case. 

The points E iaik were constructed by means of the vertices 
A 0 , ...,A n and the point E. We now prove, as a converse to the above 
result, that if we are given any S n _ 1 not containing any vertices A t , 
we can find a point E such that all the F i} constructed from E lie' 
in this S n _ v Let A { A } - meet this S n _ 1 in the point F i} . If 

Eif = (AiA^/Fy, 

we show that the primes 

A 0 , ..., Af_ v A i+1 , ..., Aj_ v A f+1) ..., A n , E^ 

all meet in a unique point E. (In the special case we define E {j as the 
point where A i A i meets S n _ v ) 

First consider the case n = 2. If we are in the special case, for 
which E {j = (A i Aj)/F ii = F i} , we construct E i} by drawing ^4 f ^4 fc , 
AjA k , F i:j Fj k F ki through A is Aj, F i} respectively to form the triangle 
A k F ki F jk . If AiF kj , A } F ki meet in E m then A k E ijk passes through 
(A i A j )/F ij = E i} . Since F ki = E ki , F kj = E kj , E ijk clearly satisfies 
the requirements of our theorem. If we are not in the special case, 
E i} + F i} , and we have 

■A-u = (EikFuc)IAi, A k — (E jk F ik )IAj. 

Hence it follows that the lines A i A j , E ik E jk , F ik F jk are concurrent, 
that is, E ik E jk passes through F tj . Consider the triangles A i A j A k , 
E }k E ki E i} . Corresponding sides meet in the collinear points F jk , 
F ki , F tj . Hence, by the converse of Desargues’ Theorem the tri- 
angles are in perspective from a point E ijk , and this point satisfies 
the requirements of the theorem. It is easily verified that if E i}k 
lay on any side of the triangle A i A^A k> the line F ik F u F i) w o«ld 
contain a vertex of the triangle, contrary to hypothesis. 

We now suppose that the theorem is true for each face L i and 
construct the points E o i _ li+l n . If i =t=j, the lines Aj E o i _ li+l n 
and hay e th e point in com - 

mon. Hence the lines A i E 0 i+1 _ _ n and AjE^ n intersect. 

This is true for all values of i and j. Since all these lines cannot lie 
in a plane they must all pass through a point E. This point does not 
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lie in any of the E { , and from E and the vertices A 0 , ...,A n we 
return to the points F ip if we carry out the constructions given at 
the beginning of the section. 

The configuration discussed above will be called a system of 
reference in the space. 

We have seen [§ 3, Th. I] that a projectivity can be established 
between two ranges on lines l, l' so that three distinct assigned 
points on l correspond to three distinct assigned points on V. We 
also saw, later on, that this theorem cannot be extended to any four 
pairs of corresponding points. But we do not know, as yet, whether 
the projectivity is uniquely determined by the assignment of three 
pairs of corresponding points. 

We may therefore establish a correspondence between ranges on 
any two fines A t Aj and A h A k of our reference system so that 
A t , Aj, E i} correspond to A h , A k , E hk respectively. We wish to do 
this, however, in such a way that if A { A p A h A k and A t A m are three 
fines between which such correspondences are established, and p u 
corresponds to P hk , and also P hk corresponds to P lm> then P tj also 
corresponds to P lm (P ip P hk , P lm being, of course, points on A i A j> 
A h A k and A l A m respectively). We do this as follows. 

Let T] k denote the operation of projecting A t A } into A { A k from 
the vertex F jk , which is E jk in the special case. The result of following 
the projection T) k by the projection T kl or by the projection T\ will 
be denoted by Tj k T l kl , or by T) k T\ b respectively. We note that 
T) k T% c is not defined unless 

(i,k) — (a,b) or (i,k) = (b,a). 

Since the operation Tj k T kj consists of projecting A t A } into A i A k 
and back again, we write Tj k T kj = 1, where, as is usual, 1 denotes 
the identical transformation. 

Now, we can pass from the points (A it A p E i} ) to the points 
(A h , A k , E hk ) by a series of projections T% c , and we wish to show that 
the correspondence between the ranges does not depend on the series of 
projections chosen. We first consider the transformation between 
points on A t A t given by 

(i) T) k Tfy T{ t ~ H i} . Let P tj be any point on A { A p and let 
F Jk P i j meet A t A k in P ki ; let F ij P ki meet A } A k in P jk ; and let 
P ik P jk meet A { Aj in P' ip H p is a transformation of the points of 
A i A i into points of A { A p and we see at once that H i} A t — A p and 
H^Aj = A { ; also H i} E tj = E i} and H tj F 4j — F {j . We now consider 
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the case of any other point of It is verified at once that 

Py, P ki , P jk , P' i} are then all distinct. 



Let P ki Pij meet P jk P'ij in K. We first prove that A k K passes 
through E i} . We may construct the harmonic conjugate of F i} 
with respect to A it Aj by taking the lines A i A k , Aj A k , F^P^P^ 
through Af, A jy F i} . It follows that A { P jk and A } P ki meet on the 
line joining A k to (A t , A^/ F ijt that is, to Ey. Now consider the 
triangles A i A j A k and P jk P ki K (which are proper triangles, even 
in the special case). Corresponding sides meet in F jk , F ki , F ijy and 
hence, by the converse of Desargues’ Theorem, A i P jk and AjP ki 
meet on A k K, which is therefore the same as A k E ij . 

The triangles E ik E jk A k and P' i} P i} K are in perspective, since 
corresponding sides intersect in the collinear points F jk , F ki , Pi,- 
Hence E ik P' i} and E^P^ meet on KA k = E i} A k . Also, the triangles 
E ik E jk A k and P t} P' i} K are in perspective, since corresponding sides 
meet in Pjk > P U> Pijy and hence E^Py meets E jk P' {j on KA k = E {j A k . 
From these results we deduce a simple construction for the trans- 
formation H i} : let the join of P tj to E jk meet A k E i} in Q ; then E ik Q 
meets A it in P' i} . We also deduce the following properties of H ti : 

(®) P%i Pji’> 

(b) (V-l. 

Further, we have 

PjkPik = Pjk Pki = P)kP 1 ijP{iP}k ~ ^ij Pjk> 

(c) T\ k H ik — H i} T) k . 


that is, 
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Finally, if A h is any other vertex distinct from A { , A p A k , and 
the plane A i A j A k is projected from the point F hk on to the plane 
A { AjA h , the points A it A } , A k , P ip F jk , F ip F ik project, respectively, 
into the points, A iy A p A h , P ip F jh , F ip F ih . Hence the configura- 
tion of lines in A t A ) A k which determines the transformation H ti 
is projected into the configuration of fines in A i A } A h which de- 
termines the transformation in that plane. Since P' i} is projected 
into P\ p the transformation H i} is independent of the third vertex 
A k by which it is defined. 

We may note, in passing, that if we are not in the special case, 
so that E i} 4= F ip P' i} can also be defined as (E ip F^jPy. 

A second type of transformation is T) k T kh . We prove that 


(ii) Tj k Ti h 


rpi 
1 jh • 


Let 


T)k(Pij) 


P ik , and T l kh (P ik ) = P ih . We 


wish to prove that T^Py) = P ih - We assume that P^^Ey, since 
in that case the result is trivial. Now, the triangles Pi j Pik Pih and 
E i:! E ik E ih are in perspective from A t . Hence corresponding sides 
meet in colfinear points. Two of these points are F jk> F kh . Hence 
F jk F kh passes through the intersection of Pi} Pih and E^ E ih . But 
E^ E ih and F jk F kh meet only in F jh . Therefore P i} P ih passes through 


F jh . That is 


P}h(Pij) — Pih • 



Finally, if i, j, h, k are all different, we prove that 

(iii) Tj k T% h = ThTft. Both transformations in question take 
ranges on A t A } into ranges on A h A k so that the points A p E ip A } 
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are transformed into the points A h , E hk , A k respectively. Let 
Pff be any other point of A { A p and let 

Fjk ;(P{j) = Pik> Pih(Ptk) ~ Pfik- 

If T k( p ij) = P)h> we wish to prove that T%(P jh ) = P hk . 

Now, P i} P ih passes through F ih , and P ik P hk passes through F ih . 
Hence these two lines (which are distinct if Pn ±A {) as we have 
supposed) lie in a plane. Therefore PijPik and PjhPhk meet. Since 
one of these lines lies in the plane A i A j A k , and the other in A h AjA k , 
they can only meet on the line of intersection, which is given by 
AjA k . But since Pa =t=Aj the line PijPik meets A j A k only in Fjk- 
Hence PjhPhk Passes through F jk . That is, T%(P jh ) = P hk . 

We use these three types of transformation to prove the main 
theorem of this section. Consider two series of T\ k transformations 
which transform a range on A p A q into a range on A r A g so that 
A p , E pq , A q transform into A r , E rs , A g respectively. We wish to 
prove that the resultant projectivities are identical. Let one series 
of transformations be 


mp ml 
- 1 ql * mn 9 9 • 

mp mV 

■* qV ± m'n 9 9 • • • 


and the other T% v T l m , n 

Since ( T l m . n . ...y 1 = (... T l n , m . Tf q ) 

it is sufficient to prove 

Theorem I. The resultant of any series of T\ k transformations 
which transforms the line A a A b into itself in such a way that A a , A b 
(and therefore E^) correspond to themselves must be the identity trans- 
formation. 

If there are only two transformations in the series, they must be 
either T bc T% or T b ac T% a in order that A a and A b should be self- 
corresponding. As we have seen, each of these transformations is 
the identity transformation. 

If there are more than two transformations in the series, consider 
any sequence of three transformations. If two consecutive trans- 
formations in the selected three involve the same upper index we 
can substitute one transformation for the pair, using the result of 
(ii) above. Hence we need only consider a sequence such as 

mi rpk ml 
jk il 

and consider the various possibilities. 

(a) l =j, m — i. By (i) above the sequence can be replaced by 
H {j . If this is preceded by any transformation of the series, this 
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transformation must be either or In either case we may use 
the result of (i) (c) to move the H it transformation towards the head 
of the series of transformations. When it reaches this position it 
must be an H ab transformation. 

(6) l = j, m =j= i. Then 

T\ k T% T{ m = T) k T\ m Tf } (by (iii)) 

= Vfi (by(ii)). 

(c) If 1 4= j, we have, similarly, 

T\ k n 

^ km T ii T ik T km (by (iii)) 

= T^T\ m , (by (ii)). 

Hence, in all three cases, we can reduce the number of transforma- 
tions in the sequence. Finally, the whole series of transformations 
must be reducible to one of the following forms: 

W, (H ab m c) (H ab rT% c Tl b (<r = 0orl). 

Only the first of these transformations transforms A a into A a and 
A b into A b , and then only when cr = 0. Hence we can reduce the 
series of transformations to the identity transformation, and the 
theorem is proved. 

This theorem enables us to establish a unique projectivity between 
ranges on any two lines A a A b and A { A p in which A a , i? a6 , A b 
correspond respectively to A { , E ip A p Such a projectivity will be 
used later and we shall denote it by T p . Then T% b = 1 , and T b ^ = H^. 

7. The algebra of points on a line. In § 5 we discussed two 
protectively invariant constructions. We now use these to construct 
a field, which will be used in a later section to introduce coordinates 
into our space. 

The constructions depend on the choice of three distinct points 
0, E, U on a line l. We call the process of choosing these points the 
establishing of a scale on l. Our constructions determine points on l 
from any two points A, B, neither of which coincides with U. The 
points determined are always distinct from U. We have, therefore, 
two laws of composition for the points of l other than U. We 
consider these laws of composition separately. 

Construction I is symmetric in A and B, and we denote the 
derived point T by A + B. We show that this law of addition is 
associative. 
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Let A, B, C be three points on l. The result is evident if any one 
of these points is at 0, so we confine ourselves to the case in which 
they are all distinct from 0. Perform the usual construction for 



T = A + B, using the notation of § 5. To construct T +C we draw 
TSP through T and let OS meet UQ in Q', which is necessarily 
distinct from S. If CQ’ meets US in S', then PS' meets l in T, 
where T+C = T. 


Using the same figure, QS' meets l in T' = B + C. We construct 
A + T' by means of the lines ARP, T'S'Q, and ORQ. Then PS' 
meets l in A + T'. But this is T. Hence 

A + {B+C) = (A + B) + C. 

We have already seen that if A is any point on l distinct from U, 

A + O — A. 

We show now that given any two points A, Boil (distinct from U), 
there is a point X of l, also distinct from U, such that 

A + X = B. 

It is sufficient to show that there is a point Y such that 

A+Y = 0. 

For if Y has this property, the point 

X= Y + B 

satisfies our requirements. Let A' be the point ( O , U)/A. Then, in 
the usual construction for harmonic conjugate, as in Fig. 32, the 
lines ARS, A'PQ are drawn through A and A', and ORQ meets 
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them in R, Q respectively. Also, UQ, UR meet them in 8, P respec- 
tively, and PS meets l in A+A'. Hence 

A+A' = 0, 

and A' is the required point Y. Since A + Y = 0, we shall usually 



denote the point Y by —A, and write B — A for B + { — A). We 
have proved 

Theorem I. The points of the line l other than U form a com- 
mutative group under the addition law. 



We now consider Construction II of § 5, which also gives us a 
law of composition for the points of l other than U. We regard this 
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as a multiplication, and write T = A- B. Let us prove that multi- 
plication is associative. It is easily verified that 

A'(B-C) = (A'B)'C 

if any one of the points A, B, C coincides with 0 or E, and we 
therefore suppose that none of the points A, B, G coincides with 0 
or with E. Now, draw two lines OU’, UU' through 0 and U , inter- 
secting in U' . Through A draw ARP, meeting OU' , UU' in R 
and P respectively, and let ER meet UU' in Q, QB meet OU' in 8, 
ES meet UU' in Q' , and CQ' meet OU' in S'. Let PS meet l in T, 
QS' meet l in T', and let PS' meet l in T. 

Considering the quadrangle PQSR, we see that T = A- B, and 
from the quadrangle PQ'S'S that T = T'G. On the other hand, 
the quadrangle QQ'S'S shows that T' = B‘C, and the quadrangle 
PQS'R that T = A • T'. Hence 

A-(B-C) = {A- ByC. 

Multiplication is also distributive over addition. We prove that 

C-{A + B) = C-A + C-B. 



In a plane through l take a point Z, and then a point Y on the 
line ZU. Join YB, Y A and ZC. Let YE meet ZC in C', and let 
OC meet YB and Y A respectively in P' and P. Let ZP' and ZP 
respectively meet OU in B' and A', and let UP and UP' meet 
ZP' and ZP respectively in R' and R, these same lines meeting 
YP' and YP respectively in Q' and Q. 
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Since corresponding sides of the triangles PQR, P'Q'R' meet in 
the collinear points U, Z, Y, the converse of Desargues’ Theorem 
shows that QQ' passes through the intersection F of PP' and RR' . 
Also, if C'Y , G'Z meet QQ' and RR' respectively in H and K, then 
since corresponding sides of the triangles YHQ' and ZKR' meet in 
the collinear points F, P' y C', it follows that KH passes through 
the intersection U of YZ and Q'R'. 

We can now prove our theorem. Denoting the intersections of 
QQ' and RR' respectively with l by S and M, the quadrangle 
QPQ'P' shows that S = A + B, and the quadrangle CFHK that 
M — C • S = C-(A + B). On the other hand, consideration of the 
quadrangles C'PYZ, C’P'YZ and P'RPR' respectively tells us 
that A' = C'A, B' = C‘B and M = A' + B' respectively. We have 
therefore shown that 

C-(A + B) = C-A + C-B. 

In a similar fashion it may be proved that 

(A + J3)*0 = A-C + B-C. 

We have already noted in § 5 that 

A-E = A = E-A. 

Hence the point E acts as unity under multiplication. Finally, if 
A is any point on l distinct from O (and U) we can construct an 
inverse A* such that 

A- A* = E = A*- A. 



The construction of A* is evident from the diagram, as is the proof 
of the properties of A*. 


HP I 


17 


258 VI. PROJECTIVE space: synthetic definition 

These results show that the set of points on l, excluding U, oan 
have two laws of composition associated with them, one a law of 
addition, the other a multiplication law, and that under these laws 
the points form a field. In this field 0 is the zero, E the unity. We 
sum these results in 

Theorem II. Under the two laws of composition defined by Con- 
structions I and II the points of the line l other than U form afield. 

Before passing on it is convenient to prove here a result, which 
we shall use later, concerning the construction of the inverse A* 
of A. 



Let RS and QP meet in Z, Q8 and RP in X, and let ZX meet l 
in F. Since ZE is the harmonic conjugate of ZX with respect to 
ZP and ZR, F — (0, U)jE. Again, XS, XR are harmonically con- 
jugate with respect to XE, XZ. Hence A* and A are harmonically 
conjugate with respect to E and F. In other words, we have (if we 
are not in the special case) 

Theorem III. If F is the harmonic conjugate of E with respect to 
0 and U, the inverse of A may be defined as the harmonic conjugate 
of A with respect to E and F. 

Returning now to the consideration of the field we have con- 
structed, we note that it has been defined by selecting on a line l 
an ordered set of three points 0 , E, U. Let us denote the field which 
is then defined by K(0, E, U). If O', E' , U' be any other ordered 
set of points on a line l', we may similarly construct a field 
K(0', E', V). Now, there is a projectivity which relates a range on 
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l to a range on V, 0, E, U corresponding to O', E', U' respectively. 
If points P, Q, ... of the first range (other than U) correspond to 
points P', Q', ... of the second range, then P + Q corresponds to 
P' + Q', and P* Q corresponds to P' • Q' . For we have proved that 
Constructions I and II yield projectively invariant points. Hence, 
there is a one-to-one correspondence between the elements of the 
fields K(0, E, U) and K(0', E ' , V) which is, in fact, an isomor- 
phism. 

In particular, there is a projectivity which transforms l into 
itself, 0, E, U being self-corresponding points. This defines an 
automorphism of K(0, E, U) which is the identical correspondence 
only if the projectivity is the identical projectivity. 

Since all the fields we can construct by establishing a scale on 
any line l in S n are isomorphic, we can call any field isomorphic 
with them the field of the geometry. There is, of course, another field, 
for which addition is defined, as above, by Construction I, and 
multiplication by interchanging the roles of A and B in Construc- 
tion II. This second field is the skew-isomorph [V, § 1] of the field of 
the geometry. 

8. The representation of the incidence space as a P^fST). We 

are now in a position to introduce coordinate systems into our space 
S n . We first choose a reference system [§ 6], and use the notation 
already introduced. On any line A { Aj there are three points A ( , 
A )y E ti . We choose these as O, U, E, and define, as in the preceding 
section, a field by means of these points. All the fields obtained in 
this way are isomorphic, and the transformations T% d defined in 
§ 6 establish, in a unique fashion, an isomorphism between the fields 
K(A a> E^, A b ) and K{A C , E cd , A d ). 

This means that if K denotes the field of our geometry, we have 
established without ambiguity, for every choice of a, b, a corre- 
spondence between the points of A a A b (other than A b ) and the 
elements of K in which A a corresponds to zero. The element of K 
which corresponds to the point P on A a A b will be called the co- 
ordinate of P in the scale (A a , E^, A b ); or more briefly, when there is 
no risk of ambiguity, the coordinate of P. The coordinate of E^ 
is always 1. 

If P is distinct from both A a and A b , and a is the coordinate of 
P in the scale {A a , E ab , A b ), the coordinate a* of P in the scale 
(A 6 , E ba , A a ) is found as follows, when we are not in the special case. 
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The transformation takes P into P*, where P and P* are 
harmonically conjugate with respect to E ab and F^ [§ 6], and of 
course this transformation takes P* into P. But [§ 7, Th. Ill], if 
P and P* are harmonically conjugate with respect to E^ and F ab , 
the corresponding elements of K are inverses. Since the coordinate 
of P* in the scale (A a , E ab , A b ) is the coordinate of P in the scale 
(A b , E ba , A a ), the coordinate of P in this last scale is a* = a -1 . 
Hence 


Theorem I. If P is any point of A a A b distinct from A a and A b , 
and a, a* are its coordinates in the respective scales ( A a , E ab , A b ) and 
(A b ,E ba ,A a ), then aa* = 1. 

This theorem is also true in the special case, which has been 
excepted from the foregoing proof. To see this we first note that in 
the special case the configuration of Fig. 36 has the following 
properties: P, E, S are collinear points, and lie on the sides of the 
triangle OZU, therefore OP, ZE, US are concurrent. Let the point 
of concurrence be T. Also, Z, E, X are collinear. Now identify this 
configuration with that of Fig. 29, identifying 0, U, Z, T with 
A { , A j} A k , E ijk respectively. Then E, P, S are identified with E ijy 
Ej k , E ki respectively. We saw in § 6 that if P i} is any point on A i A i , 
the point P'a which corresponds to Pit in the transformation H i} 
which makes A { , E ijy A i correspond to A p E jit A { respectively is 
obtained thus: E jk P i:j meets E ik P' i} on A k E ij . Now, returning to 
Fig. 36, with the above identification, the point A' corresponding 
to A is constructed as follows: let AP meet ZE in X, then XS 
meets l in A'. But this shows that A* = A'. Hence it follows that 
Theorem I remains true in the special case. 

We now consider any point P of the given space, and we suppose, 
in the first instance, that P is not in any face of the simplex A 0 , ...,A n . 
If we use P as we used E in the constructions of § 6 we obtain a 
series of points 


Pi> Pi}) Pijk) •••» 


where P i o-ik lies in the space of k dimensions determined by 
A io , . ..,A ik , but not in any face of the simplex formed by these k + 1 
points. Since P i} — P it is on A t A t and distinct from both A i and A } , 
it has a coordinate in each of the scales (A it E ijy A } ) and (A jy E }i , A { ). 
Let these coordinates b ep }i andp^ respectively. Then, by the above 
theorem, 


PaPu = 1. 
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Theorem II.* If i, j, k are all distinct, 

PikPkjPjt = !• 



Let F jk P ki meet A { A, in Q ki , and let F ki Q ki meet AjA k in R H . 
Clearly T%P ki — R ki , and therefore the coordinate of R ki in the 
scale (A jy E jk ,A k ) is p ik . Similarly, if P {j F ki meets A j A k in Q i} , 
then TjiPy = Q ij} and the coordinate of Q i} in the scale (A Jy E jk , A k ) 
is Pa- 

With these preliminary remarks we prove the 

Lemma. Q ki E jk intersects Pi)P Jk on A i A k . 

The harmonic conjugate E jk of F jk with respect to A f and A k can 
be constructed by drawing the lines AjAp A k A i and Fjk QkiPki' 
Hence A i Fki and A k Q ki intersect in a point X of A i E jk . 

Now consider the triangles E ik P ik A i and Q ki P {j A k . Corre- 
sponding sides intersect at the points P ijk , X, Aj. These points are 
collinear, and the triangles are therefore in perspective. The vertex 
of perspective Y lies on A { A k , and is the intersection of Q ki E jk 
and FijPjk- This proves the lemma. 

We now take (Ap E jk , A k ) as our scale on A } A k , and construct the 
product R ki 'P jk , as in Construction II. We use the quadrangle 
Fki FQki Fjj- The line A j A k meets the pairs of opposite sides of this 

* We are indebted to Mr G. Kreisel of Trinity College, Cambridge, for 
assistance with the proofs of Theorems II and III, which do not involve 
the assumption of Pappus’ Theorem. 
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quadrangle in the pairs of points A }> A k ; R ki , P ik \ and E jk , Q i} ; 
and we find that consequently 

' Pjk — Qa- 

Therefore p ik p kj = p i} , 

or, by Th. I, 

PikPkiPa - PijPji = 1. 

This proves the theorem. 

Now, if x 0 is any non-zero element of K, let 

*<= 2 ^ 0*0 (* = 1 fo + 0 ). 

Then, since PyPjoPoi = 1» 

that is, p i} = PioiPjo)- 1 , 

we see that p i} — x&j 1 . 

The set of right-hand equivalent (n+ 1) -tuples 

(x 0 ,x v ...,x n ) 

determined by different x 0 ’s are the coordinates of P with regard to 
the given reference system. 

We now suppose that P lies in the space defined by A io> ...,A ik , 
but does not lie in any face of the ^-simplex defined by these points. 
If we complete the reference system in this space by means of the 
point ik , we obtain, as above, a set of equivalent (&+ l)-tuples 
(x i() , ...,x it ) as coordinates of P with respect to the system of 
reference in the fc-space. The coordinates (y 0 , . . . , y n ) of P with regard 
to the system of reference in S n are defined as follows: 

Vi = (i = i 0 ,...,i k ), 

y t = 0 (»#t 0 , 

We see that this definition is consistent with our definition of the 
coordinates of P when P does not lie in any face of the simplex 
A 0 , ...,A n . In fact, if the coordinates of any point P are (y 0 , . . . , y n ), 
a necessary and sufficient condition that = 0 is that, for each 
j 4= i, the space defined by the set of points 

Aq, ..., A{_ j, ..., Aj — j, Aj+i, ..., A n , P 

intersects the line A i A i at the point A } , or that this set is linearly 
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dependent. For the intersebtion is the point and in the scale 
(A p E {j , A t ) the coordinate of P {j is p if = ViHj 1 . If P tj = A p 

Pu = ViVT 1 “ °- 

and therefore y { = 0. Also, a necessary and sufficient condition that 
P lies in the space defined by A io , . . . , A ik , and not in any face of this 
simplex, is that the points 

Aq) • ••, Af_i, Af+i, . .., Aj_ p Aj+i, . .., A n , P (i 4= i 0 , h) 

are linearly independent and determine a space which intersects 
A { Aj in A p where j takes the values (i 0 , ..., i fc ). Also, if P is not in 
the space A ik+1 ...A in the space A ik+1 ... A in P meets the space 
A ia ...A ik in a point P i<t ik whose coordinates are (z 0 , ...,z n ), 

^bere ^ ^ ^ __ ^ ^ = 0 {j = k+ 1, 

We leave the proofs of these last results to the reader. 

We show now that any set of right-hand equivalent (n+\)-tuples 
corresponds to one and only one point P of the space. 

Let (x 0 , . . . , x n ) be any (n + 1) -tuple of the set. If 


Xiu , . = 


X in = °> 


t 'ii+ 1 ~ — 

P must lie in the fe-space defined by A io , ...,A ik . We may therefore 
suppose that no x t = 0. 

On A 0 A t take the scale (A 0 , E oi , A t ), and let P t be the point on 
this fine which has the coordinate x^ 1 in this scale. Then, by 
hypothesis, P i 4=A i (i = 0, ...,n). The n primes defined by the points 
A v ...yA^p A i+1 , ...,A n , Pi have a single point P in common. We 
prove this by induction on n. 

When n — 2, the result is trivial, since two lines intersect in a 
point. We assume that the theorem is true for an S n _ v in particular 
for that defined by A 0 , ...,A n _ v Then the n— 1 spaces defined 

^ A j, . . . , Aj_ k , Aj+p . . ., A n , Pj (i — 1 , . . ., w 1) 

intersect in a space S r which, by hypothesis, meets the S n _ x defined 
by A 0 , ..., A n _ x in a single point P; for meets the 8 n _ x defined 
by A 0 , . . . , A n _ x in the S n _ z defined by A x , . . . , A t _ v A i+X , . . . , A n _ v P t . 
This point R does not he in any face of the simplex A 0 , ...,A n _ v 
Again, A n R does not he in the S*_ x defined by A v ...,A n _ v P n . It 
therefore meets it in a single point, and this is the required 
point P. It is evident that the coordinates of P are ( x 0 , ..., x n ). 



204 VI. projective space: synthetic definition 

The condition for the collinearity of three points is our next 
subject. We prove 

Theorem III. A necessary and sufficient condition that the points 
P = (x 0 , ...,x n ), Q = (y 0 , ...,y n ) and R = (z 0 , ...,z n ) be collinear is 
that the rank of the matrix 


/* 0 

i/o 

*o\ 


r 

2/i 


(1) 

\*» 

Vn 




be less than three. 

If the rank is one, the three (n + 1) -tuples given by the columns 
of this matrix are necessarily equivalent, and the three points 
P, Q, R coincide. If, conversely, they coincide, the rank is clearly 
one. We may therefore omit this case. 

Similarly, in proving the necessity of the condition, we may omit 
the case when two of the points coincide. 

Since the rank of the matrix is now two, at least, there are at 
least two linearly independent rows. Without loss of generality we 
may assume that the first two rows are linearly independent. A neces- 
sary and sufficient condition that the matrix is of rank two is that 
every other row of the matrix is linearly dependent on the first 
two rows. 

(1) We first prove that the condition in the enunciation of 
Theorem III is necessary. By what we have just said, it is enough 
to prove that if P, Q, R are distinct collinear points such that the 
left-hand vectors (x 0 ,y 0 ,z 0 ), (x lt y v z t ) are linearly independent, 
then any other row of the matrix ( 1 ), regarded as a left-hand vector, 
is linearly dependent on these two. The first step is to show that 
this simplified form of the hypothesis implies that the line PQR 
does not meet the space determined by A 2 , ..., A^, A i+1 , ..., A n 
(i > 2). If P, Q, R all he in this space, then 

x 0 = x i = = 0; y 0 = yi = yi = 0; z 0 = z 1 = z i = 0, 

and therefore the first two rows of the matrix (1) consist of zeros, 
which is contrary to hypothesis. Next, suppose that the line PQR 
meets the space determined by A 2 , ...,A { _ lt A i+1 , ...,A n in a single 
point. Then the S n _ 2 containing this space and the line PQR meets 
the plane A 0 A 1 A { in a single point, S, say. If P lies in the space 
A 2 ... Af_iAf + i ... A n , 

x n = X, = Xi = 0 , 
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and if it does not, P 01i is determinate, and equal to S. A similar 
argument can be applied to Q and to R. Hence in all cases we have 

x } = tf A, y j = t } /i, Zj = tjV (j = 0, 1,*), 

where S is ( t 0 , t lt 0 0, t it 0, ..., 0). Therefore 

/* o Vo 2 o\ = (to 0 0\ /A n v\ 

(xj y x zA 0 0 0 0 0), 

\ x i Vi *J \h 0 0 / \ 0 0 0 / 

and is therefore a matrix of rank one. We conclude that the first 
two rows of the matrix (1) are linearly dependent, which is contrary 
to hypothesis. 

Since the linear independence of ( x 0 ,y 0 ,z 0 ) and (x v y v z x ) has 
been shown to imply that PQR does not meet the space determined 
by A 2 ,A 3 , ..., A { _ v A i+1 , ...,A n , it follows that the collinear points 
PoH, Qou> R ou are distinct. To simplify the notation we denote these 
points by P', Q' , R' . We have now to show that the collinearity of 
these three points implies that the ith row of the matrix (1) is 
linearly dependent on the first two rows. 

If the line P'Q'R' lies on a side of the triangle A 0 A 1 A i , 


x i = Vi = Zj - ° 

and the matrix 


(j 

= 0, 

O 

II 

I—- 

O 

II 

[ x o 

Vo 

2 o\ 

Xi 

Vx 

21 

\ x i 

Vi 

zj 


( 2 ) 


is of rank two. If this is not the case, but P'Q'R' passes through one 
vertex only of the simplex AqA^A^ say the vertex A } , let us first 
suppose that P', Q\ R' are all different from A } . Then 

x h x k 1 = VhVk 1 = z h*k l , 

where ( j,h,Jc ) is some derangement of (0, 1, i); for we obtain the 
same point on the side A k A h if we project P', Q' or R' from Ay 
Thus the matrix 

( x ) x k x ViVk 1 z j z k x \ 
x h x k x VhVk 1 

1 1 1 / 


has two rows which are linearly dependent, as left-hand vectors; 
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its rank is therefore less than three. But this matrix is the 
product 

l x i Vi Z i\ fck 1 0 0\ 

Vh z hJ I 0 Vk 1 0 ]» 

W Vk Z J \ 0 0 Z k V 

that is, the product of the matrix (2), with its rows possibly 
deranged, by a regular matrix. Hence the rank of the matrix (2) 
must be less than three. 

If, on the other hand, P' = A jt 

jxj Vi *A = /I Vi *A 

I ** y» *h \ o Vh Zft), 

W Vk Z J \o y k zJ 

where y h y k 1 = z^ 1 , 

and we prove as before that the matrix (2) is of rank two. 

We are therefore left with the case in which the line P'Q'R' does 
not pass through a vertex of A 0 A 1 A i . Let V = P'Q'R' meet the 
sides of AgAj^Af in the points U 0 , U x , U 0 and let X be any point on V . 
In the first instance let X be distinct from U 0 , TJ X and 17*. Let A 0 X 



meet A x A t in X l{ , and A { X meet A 0 A X in X 01 . Let X li U 1 meet 
A 0 A X in N, E U U X meet A 0 A 1 in L, and Kt L meet A X A { in M. 
Then, in this construction, the points U 0 , U lt U it L, M depend only 
on l', and not on the point X chosen on it. 


207 


8. THE INCIDENCE SPACE AS A *>£(*:) 

Taking the scale {A x , E xi , A { ) on A x A t we construct the product 

M'X U as follows. Through M, X xi , E u we draw the lines ifF w , 

X Xi U x and E xi U x respectively. Then, by the usual construction, 

F oi N meets A x A t in the required point 8 = M-X xi . If we denote 

the elements of the ground field K corresponding to the points Y 

of A x At by y we have 

1 * J 9 mx iX = 8. 

Passing from A x A t to A X A 0 by the transformation T\l, it follows 
that in the scale (A v E xo , A 0 ) the product L-X' xi = N. If, in the 
same scale, we construct N + X ox , using the quadrangle XU l A i X li) 
we see that N + X 01 = U t . That is 

mx a +x ox = Ui. 

It follows, that if X is the point (£ 0 , 

w£i + £o = «*£i- (3) 

We note that this formula holds when X is at U 0 , U x or U t . For, if 
X = U 0 , N = U t and L-X' xi = U it so that = u^, and = 0. 
If X = U iy ^ = 0, and u t = x ox . Finally, if X = U x , let F xi U x meet 
A 0 A X in U x . Then L and U x are harmonically conjugate with 
respect to A 0 and A x . But the elements of K which correspond to 
L and to U x in the scale {A x , E xo , A 0 ) are m and u x = x 0{ respectively. 
Hence m = — x Qi , and £, x = 0, so that once again the formula is 
satisfied. 

Since the formula (3) is true for all points on V, it is true, in 
particular, when X coincides with P', Q' and R' respectively. The 
rows of the matrix (2) are therefore linearly dependent, and so 
once again it is of rank two. 

Applying this result for i = 2, . . ., n we deduce that if P, Q, R are 
collinear the rows of the matrix 




are all linearly dependent on the first two rows. The rank of this 
matrix is therefore two. We have proved the necessity of our 
theorem, and we now prove its sufficiency. 

(2) Let us suppose then that the rank of the matrix is less than 
three. We must show that P, Q, R are collinear. If the (n + l)-tuple 
defined by any column is equivalent to that defined by any other 
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column the points P, Q, R are not distinct, and the theorem is 
trivial. We need only consider, therefore, the cases in which P, Q, R 
axe distinct. 

The point P = (x 0> ...,x n ) cannot lie in every face of the simplex. 
Suppose, for instance, that it does not lie in the face 
Then a; 0 + 0. A necessary and sufficient condition that P, Q, R are 
collinear is that PQ and PR meet the face A r ... A n in the same 
point. If PQ, PR meet this face in Q', R', then by the first part of 
the theorem, proved above, Q' is linearly dependent on P, Q and 
satisfies the equation y' 0 = 0, if we suppose that Q' = (y' 0 , ...,y' n ). 
Hence, by (1) above, 

Q ' = (0>y»~‘>y'n) = (0,2/i-Zi^b/o. ...,y n -x n Xo 1 y 0 ). 
Similarly, 

R' = (0,zi, ...X) = (0, Zj — ...,z n — z n XQ 1 z 0 ). 

Now, the matrix 

x o Vo z o\ /I 
Vx z i I \ ° 

. . ,l\o 

Vn z iJ 

is the product of the original matrix by a non-singular matrix, and 
therefore its rank is equal to that of the original matrix. By hypo- 
thesis, this rank is less than three. The columns of the first matrix 
are therefore linearly dependent. Since » 0 =# 0 this can only be the 
case if the submatrix 


is of rank one, that is, if Q' = R'. Therefore P, Q, R are collinear, 
and both parts of Theorem III are proved. 

Theorem IV. A necessary and sufficient condition that the points 

Pi = (*«>*!*> ■■■> X ni) (i = 0, 

be linearly dependent is that the rank of the matrix 







( 4 ) 


be less than r+ 1. 
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We have proved this theorem when r = 2, and we therefore 
assume that it is true for r points, and use the principle of induction. 

(1) Necessity. Suppose that the r+ 1 points are linearly depen- 
dent. If any set of r of these points is also linearly dependent 
then, by the hypothesis of induction, the corresponding columns 
of the matrix (4) are linearly dependent. The rank of this matrix 
must then be less than r+ 1. 

If no set of r points is linearly dependent, the first r points of the 
set of r + 1 points determine a space S r _ x which contains the re- 
maining point P r . In this S r _ x the points P»~',Pr-l define an $ r _ 2 
which is met by the line P 0 P r in a point R, say. By the induction 
hypothesis we may write, with the usual symbolic notation [V, § 4], 

A* 

<-i 


and also 


R = -P 0 \-P r A r . 


r 

Hence, £ i* A* = 0, and this means that the columns of the matrix 
o 

(4) are linearly dependent, and therefore the rank of (4) is less than 

r+1 - r 

(2) Sufficiency. Let the symbolic relation '^ l P i \ i = 0 express 

o 

the Unear dependence of the columns of the matrix (4). There must 
be at least two A f which are not zero. If there are only two, the 
corresponding P t coincide. The theorem is evident if any of the 
P i coincide. If more than two A* are not zero, say A 0 , A x , A 2 are not 


zero, then the points P 0 \ 0 + P^, — 2 -P»A f are coincident. Hence 

2 

the points Po>Pi and P 2 , ..., P r define intersecting spaces, and there- 
fore by § 1, (6), p. 210, the points P 0 ,Pi, •••,P r , are linearly dependent. 

Just as in Chapter V we deduce that the points of a fc-space S k 
are precisely those which satisfy n — k Unearly independent left- 
hand linear equations. 

We have now proved that an incidence space E n has the property 
that its points can be put in one-to-one correspondence with the 
points of a projective number space PN r n (K) in such a way that the 
points of E n lying in an S a correspond to the points of PN r n (K) 
which satisfy n — a Unearly independent left-hand Unear equations. 

Now let y = Ax, x = By (AB = I n+l ) 


be a projective transformation of PN r n (K). This gives us a new 



270 VI. PROJECTIVE space: synthetic definition 


correspondence between the points of Z n and the points (y 0 , ..., y n ) 
of PN r n (K). A necessary and sufficient condition for the collinearity 
of three points of Z n is still that the corresponding matrix of the y 
coordinates be of rank less than three, for the rank of a matrix is 
invariant under a non-singular transformation. 

Now let A\ (i = 0, n) be the points of Z n corresponding to 
those of PN r n (K) with coordinates (£ f0 , and let E' be the 

point in Z n which corresponds to (1, 1, 1) in PN?(K). Then we 
can construct a new reference system in Z n , using A' 0 , ...,A' n , E' in 
place of A 0 , ...,A n , E. Let P’ be any point of Z n which is not in 
any face of the new simplex, and let the corresponding point in 
PN r n (K) (more briefly, its y -coordinates) be (y 0 , ■■■,y n ). We show 
that we can construct a coordinate system for Z n , using the methods 
described above, in which the coordinates of P' are also (y 0 , ...,y n ). 

First of all it is evident that if the prime 


A’ A ' A ’ A' A' A' T>' 

71 0 ••• ••• ••• n -n r 

meets the line A\A) in P' i} , then the y-coordinates of P' i} are 
(0, ...,0,^,0, ...,y p 0, ...,0) = (0, ..., 1,0, 


There is, indeed, a one-to-one correspondence between the points 
of A^A'j (other than A)) and the field K, given by the y-coordinates 
of the points, in which A\ corresponds to 0, and E' i} corresponds to 1. 
Let us now consider the effect of the fundamental constructions on 
the points of A^A), choosing (A' it E' i}> A)) as the scale. By V, §7, 
we see that if the points Q, R of the line correspond respectively to 
the elements q, r of K, the points Q + R and Q • R correspond to the 
elements q + r and qr of K. Hence we can set up an isomorphism 
between the field determined by choosing the scale (A\, E'^A)) 
on A\ A) and the field K. 

Now, if the y-coordinates of any point Q' on A^A) are 

(0, ...,0,y,0, ..., 1, 0, ..., 0), 


let R' be the point on A^A^ with y-coordinates 


(0, .-.,0,q, 0, ...,0, ..., 1, ...,0). 

Then Q'R' meets A^A^ in the point F' jk with coordinates 

( 0 , ..., 0 , 0 , ..., 1 ,...,- 1 ,..., 0 ): 

From these results it follows that if we choose the scale ( A ’ 0 , E' ov A^) 
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on one line, AqA^ say, and construct our field as in §7, the iso- 
morphism between the field so constructed and the field K of the 
geometry can be set up so that the constructed coordinates of P' 0l 
are the same as its ^-coordinates. We may then use the trans- 
formations Tj k to assign coordinates to points on any edge A\A' m 
of the simplex of reference, and the constructed coordinates of these 
points are also the same as their y-coordinates. Hence the con- 
structed coordinates of P’ are the y-coordinates of P' for all 
points P ' . 

Our method of introducing coordinate systems into Z n allows us 
to establish a one-to-one correspondence between the points of E n 
and those of a right-hand projective number space PN r n (K), and, 
furthermore, as we have just seen, we can construct a new coordinate 
system corresponding to each projective transformation in PN^(K). 
But we cannot say, conversely, that every coordinate system ob- 
tained by the methods of this section necessarily arises from the 
original one by means of a projective transformation of PNn(K). 

In fact, let us consider the coordinate system (x 0 ,...,x n ) con- 
structed at the beginning of this section. This gives, of course, an 
isomorphism between the points P oi A 0 A 1 other than A x and the 
elements p of the field of the geometry, in which A 0 and E 01 corre- 
spond to 0 and 1 respectively. There may, however, exist other 
isomorphisms between the points of A 0 A x other than A 1 and the 
elements of K satisfying the same conditions, in which the point P 
corresponds to the element p of K, where p is not always the same 
as p. This isomorphism maps A 0 on the element 0, E 01 on the element 
1, the point P + Q on p + q, and P-Q on pq. If we now construct a 
coordinate system as above, using the T) k transformations, we may 
obtain a coordinate system for the incidence space which is not 
obtainable from PN r n (K) by a projective transformation. For 
example, if K is the field of complex numbers, and p denotes the 
conjugate complex of p, the coordinates of a point (x 0 , ...,x n ) are 
(x 0 ,...,x n ) in the new coordinate system. This transformation is 
not a projectivity, since it transforms the fine x z = x 3 = ... = x n = 0 
into itself and every real point of this line is a united point, whereas 
a projectivity on a fine has only a finite number of united points. 

Thus the space E n can be represented as a projective space 
P n {K), and our method of establishing coordinates in E n yields all 
the allowable coordinate systems of P n (K), but, possibly, other 
coordinate systems as well. 
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The reader may ask how, starting from the Propositions of 
Incidence, in which there is no mention of right or left, we arrive 
at a right-hand projective space, rather than a left-hand space. In 
particular, how is it that a left-hand projective space P l n (K), which 
satisfies the Propositions of Incidence, can be represented as a 
right-hand projective space? The answer lies in our choice of the 
definition of A‘B in §7. We might have interchanged the parts 
played by A and B in the construction, and so arrived at the skew- 
isomorph of the field of the geometry. Using this field, we should 
have obtained a representation of Z n as a left- hand projective 
space. 


9. Restrictions on the geometry. We have shown that a 
space defined by the Propositions of Incidence can be represented as 
a projective space, as defined in Chapter V, and have, in fact, shown 
that the definitions of that chapter are identical with those of our 
present chapter. For the further development of the properties of 
a projective space it is now convenient to impose certain further 
conditions on the space. The imposition of these conditions is 
equivalent to restrictions being imposed on the field of the 
geometry. 

A. The field K is to be commutative. There are several ways in 
which this condition can be expressed. By V, § 8, we see that an 
equivalent form of the restriction is to require that Pappus’ 
Theorem is true in the geometry. In our next section we shall discuss 
alternative forms of the same restriction. In the meantime we 
prove 

Theorem I. If Pappus' Theorem holds in an incidence space 
of two dimensions, then Desargues’ Theorem is a true theorem in that 
space. 

After the discussion in V, § 6 , it is sufficient to consider the ‘ general ’ 
case, in which the vertices of the two triangles ABC, A’B’C and 
the centre of perspective 0 are all distinct points. Let the inter- 
section of corresponding sides of ABC and A'B'C be P, Q, R. 

Let QR cut OB in P', and let AP' meet A'C' in L and OC in M . 
Also, let LB' cut AB in N. Now consider the two triads of collinear 
points OAA' and LB'N. By Pappus’ Theorem ON and A'L 
intersect on P'R = QR. Hence 0, Q, N are collinear. Next, con- 
sider the triads OMC, ABN. By Pappus’ Theorem MN and BC 
meet on QR. Finally, consider the triads OMC', LB'N. By Pappus’ 
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Theorem B'C' and MN meet on QR. Therefore BG and B'C pass 
through the point common to MN and QR. That is, Desargues’ 


ff 



Theorem is true. In place of VIII (a) (p. 216), which postulates the 
truth of Desargues’ Theorem for n = 2 (in the plane), we substitute 
a new postulate: 

IX. Pappus’ Theorem is true. 

B. The field K is to be without characteristic. Let (0, E, U) be a 
scale on a line l, and define the set of points E lt E^, ... (Fig. 40) by 
the relations Ef = Er _ x + E v E x = E. 



If the field is of characteristic p, the points 0,E V ...,E V _ X are all 
distinct, but E p = 0. 


HP I 


18 
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In particular, if p = 2, E t = 0, and therefore —E = E. 
That is, 

E = — E = (0, U)jE, (1) 

and we have the special case discussed in § 4, in which the diagonal 
points of a complete quadrilateral are collinear. 

Conversely, if we are in the special case the relation (1) holds, 
and E z = 0, so that the field K is of characteristic 2. We now add 
the postulate 

X. The points E, E z , E 3 ,... are all distinct. 

Finally, we sometimes wish to impose the condition that the 
field K be algebraically closed. The incidence space Z n can be repre- 
sented as a projective space P n (K). We know [III, §4, Th. II] 
that there is a unique field K which is the intersection of all 
algebraically closed fields containing K. We can embed P n {K) 
in P n (K) [V, §3]. Hence Z n can be embedded in an essentially 
unique space Z n whose field is K. We call £ n the algebraic 
closure of Z n . Our final postulate may then be stated as: The 
space E n coincides with its algebraic closure. More simply we shall 
usually say 

XI. The field of the geometry (or ground field) is algebraically 
closed. 

In the rest of this work we shall always assume Postulates I-X. 
We shall state at the beginning of each chapter whether Postulate XI 
is to be assumed. 


10. Consequences of assuming Pappus’ Theorem. It has 

already been remarked that there are several geometrical assump- 
tions which are equivalent to the assumption that the field of the 
geometry is commutative. Our choice fell on Pappus’ Theorem 
because it is possible to state it without a considerable amount of 
preliminary explanation. 

A more fundamental result, which is equivalent to the condition 
that the field is commutative (and hence to Pappus’ Theorem), 
concerns projectivities between ranges. We have seen [§ 3] that it 
is possible to establish a projectivity between two ranges so that 
three distinct points of the one range correspond to three distinct 
points of the other. On the other hand, if we are given four points 
A, B, C, D of one range, and four points A', B', C', D' of the other, 
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there may be no projectivity in which A, B, C, D correspond, 
respectively, to A', B', C', D'. We now prove 

Theorem I. A necessary and sufficient condition that a pro- 
jectivity between two ranges be uniquely determined by the assignment 
of three pairs of corresponding points is that Pappus' Theorem be true. 

(1) Necessity. Suppose that a projectivity is uniquely deter- 
mined when three pairs of corresponding points are given. 

Let ABC, A'B'C be two triads of points on two lines l, V which 
intersect at 0. Let B'C, BC' meet in P, CA’, C'A meet in Q, and 
A'B, AB' meet in R. 

(a) If the triads are in perspective from a point V then clearly 
the lines OP, OQ, OR all coincide with the harmonic conjugate of 
OV with respect to l and V . Then P, Q, R are collinear, and Pappus’ 
Theorem is true without any further condition. 

Suppose, next, that the line joining two of the points P, Q, R, 
say Q and R, passes through 0. Then if BB' meets A A' in V z , ov 3 
is the harmonic conjugate of OR with respect to l and V. If CC’ 
meets AA' in V 2 , OV 2 is the harmonic conjugate of OQ with respect 
to l and l'. Hence OV 3 = OV 2 , and therefore Vs = V 2 , the triads are 
in perspective, and P lies on QR. 

( b ) If the triads are not in perspective, we use the hypothesis 
that a projectivity is determined by the assignment of three corre- 
sponding pairs of points to deduce Pappus’ Theorem. Let QR 
meet l in L, and V in M’ . Then the points L, M’ , 0 are all distinct, 
by what we have just proved. Let AA' meet QR in S. We denote 
the line LM'QR by l*. 

By hypothesis there is a unique projectivity between the ranges 
on l and V in which A, B, C correspond, respectively, to A', B' , C'. 
This projectivity may therefore be defined by the perspectivities 

(A, B,C,...)^ ( S , R, Q , ...) ^ (A', B', C' , ...). 

Now, regarding 0 as a point of l, there is a unique point of V corre- 
sponding to it in the range on V. Passing from l to V via l* by the 
above perspectivities, 0 becomes M' , then M'. Similarly, the point 
L of l corresponds to 0 oil'. Since, by hypothesis, the projectivity 
is uniquely determined by the pairs A, A'; B,B' and C, C' , the 
points L, M ' are uniquely determined by the projectivity. The line 
l* is therefore uniquely determined by the projectivity, and this 
line contains the points Q, R. 
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Now, if we choose B, B' as centres of perspective, we find that the 
line BP cuts l in the point L and l' in the point M’ . That is, R and P 
also lie on l*. Hence, P, Q, R are collinear, and Pappus’ Theorem 



(2) Sufficiency. We now suppose that Pappus’ Theorem is true. 
Before proving our theorem we must prove a 

Lemma. If l, V are distinct lines intersecting in O, and if ranges 
(P, Q, R, O , ...), (P', Q' , R', 0, ...) on l and l' are projectively related 
so that 0 on l corresponds to 0 on V , then the ranges are in perspective. 

It is evident that if the ranges are in perspective 0 on l corre- 
sponds to O on l'. Now, we can pass from the range on l to that on 
V by two perspectivities at most [§3]. There is nothing to prove 
if we can pass from one range to the other by a single perspectivity. 
Let there be two, then, and denote the intermediary line by l*. If 
l* passes through 0, the ranges on l and V are in perspective [§ 3, 
Th. III]. We suppose, then, that l* meets l in P, V in Q' , and that 
P, Q', 0 are distinct points. We pass from the range on l to a range 
on l* by a perspectivity with vertex L, and then from the range on 
l* to that on V by a perspectivity, vertex N. It is easily verified that 
P and Q' do not correspond in the projectivity. If L = N the 
ranges on l, l' are in perspective, vertex L = N. If L=^N, the line 
LN must pass through 0, since 0 on l corresponds to 0 on V. Let 
LQ' meet l in Q, NP meet V in P', and let LQ' meet NP in M. If 
R is any other point of l, R' the corresponding point of the range 
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on V, LR and NR' must meet in a point R* of l*. Now, applying 
Pappus’ Theorem to the triads LON, PR*Q', we see that RR' 



passes through M. Since R, R' is any pair of corresponding points, 
it follows that the ranges are in perspective from M , and the lemma 
is proved. 

We now return to our main theorem. Let ( P,Q,R ,...), 
(P', Q' , R ' , ...) be ranges on two distinct lines l, V , and suppose that 
there are two projectivities TI 1 and 77 2 between the ranges which 



make P, Q, R correspond to P' , Q' , R' respectively. If a pair of 
corresponding points, say P and P' , coincide at the intersection of 
l and V, then, by the lemma, IT 1 and fl 2 are both perspectives with 
the intersection of QQ' and RR' as vertex of perspective. In thie 
case II 1 = I7 2 . Since the three pairs of corresponding points are 
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assumed to be distinct pairs we can select points on l and V, say Q 
and R' , which are both distinct from the possible intersection of l 
and l', and do not correspond. We can now pass from the range 
on l to the range on V by two perspectivities in which QR' is the 
intermediary line l* for both projectivities TT 1 and IJ 2 [§ 3, Th. IV]. 
Let the vertices of perspective for IJ 1 be 0 and O'. If X is any other 
point of l, X' the point on V corresponding to X in TI x , then 

(P, Q, R , X) ° (P*, Q*, R*, X *) ° x (P', Q\ R\ X\ 

where Q* = Q and R* — R' . 

Now let X be the point of l' corresponding to X in 77 2 , and let 
O'X meet l* in X*. Then 

(P*, Q*, R*, X*) & (P', Q', R', X) A (P, Q, R, X) (in i7 2 ). 

Since Q* = Q, the ranges ( P *, Q*, R*, X*), (P, Q, R, X), by the 
lemma, must be in perspective. The vertex must be 0. Hence 

0, X, X* are collinear points. That is, X* = X*, therefore X' = X, 
and hence TI X = 77 2 . 

If the ranges (P, Q, R, ...), (P', Q', R ', ...) are on the same line 

1, we project the second range on to a line l' , obtaining a range 
(P 1 ,Q V R 1> ...). If there were two distinct projectivities from 
(P,Q, R, ...) to (P', Q', R', ...) there would be two distinct pro- 
jectivities from (P,Q, R, ...) to (P v Q x , R x , ...), and we have just 
shown that this is not so. This concludes the proof of Theorem I. 

We now consider certain particular projectivities on a line l. 



In Construction I of § 5 for A + B let us keep A, O, U fixed, and 
also the lines a, u. This fixes P, and we suppose that Q is also fixed. 
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R is then fixed, and b is defined as QB. As B describes a range (5) 
on OU, S describes a range (S) on UR such that 

(S) ® (. B ). 

Similarly, (T) ^ (8), 

and therefore (T) A ( B ). 

If, in the usual way, we introduce coordinates on l, taking ( 0 , E, U) 
as our scale, and if A has the coordinate a, B the coordinate x, and 
T the coordinate x ' , then 

x' = x+a. (1) 

We have shown that a transformation of this type between the 
points of l is a projectivity. 

Next we consider Construction II, keeping O, E, U, A, P, Q and 



hence R fixed, and confine our attention to the case A #= O. We take 
QB as b. Then B, S, T describe ranges on l, OR, l respectively, such 
that 

(■ B ) ® (S) £ (T), 

so that ( B ) A ( T ). 

Defining coordinates by the same scale, if the coordinates of A, B,T 
are a, x, x' respectively, then 

x' = ax, (2) 

and this transformation is also a projectivity. 
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Finally, in the construction [§ 7], p. 257, for the inverse A* of A, 



Q 

Fig. 46. 


we keep 0, E, U, Q, R fixed. Then A*, 8, P, A describe ranges 
on l, OR, UQ, l, so that 

(A*) ® ( S ) % ( P ) * (A), 

and (4*) A (A). 

If the coordinates of A, A* are x, x' respectively, then 

xx' = 1, (3) 

and this transformation is a projectivity. 

These three projectivities are of fundamental importance. If we 
consider the transform of U under the projectivities, given respec- 
tively by (1), (2), (3), we see that under (1) U is transformed into 
itself; under (2) U is transformed into itself when a 4=0, and under 
(3) U is transformed into O. There is no element of K corresponding 
to U, but it is convenient to attach the symbol oo to U. The pro- 
perties of this symbol are 

(i) oo + a = oo, 

(ii) aoo = oo (a 4= 0), 

(iii) l/oo = 0, 

1/0 = oo. 

These are the usual properties of ‘infinity’ assumed in elementary 
algebra. 

We consider now, with reference to the coordinate system set up 
on l, the bilinear transformation 

, _ ax + b 
X ~ cx + d ’ 


( 4 ) 
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where a, b, c, d are in K, and are not all zero. If ad = be this trans- 
formation reduces to x' = constant, where the constant is in K , or 
is oo. Then x' does not depend on x, and we exclude this case. We 
now consider two cases. 

(i) c = 0. Neither a nor d can be zero, and 

. a b 

x - j x + j- 

d d 

Writing x l = axjd, this becomes 

x' = x 1 + b/d, x x = ax/d. 

We can therefore pass from a; to a:' by a transformation of type (2), 
which takes x into x v followed by a transformation of type (1), 
which takes x 1 into x'. Since both transformations define pro- 
jectivities, it is clear that (4) defines a projectivity when c = 0, 
ad 0. 

(ii) c#= 0, ad + bc. We can pass from x to x' by the series of 
transformations 

, 1 (be — ad) , a 

x l = cx, x 2 = x 4 + d, x z — — , x 4 = -x z , x —x 4 + -. 

C C 

These are of types (2), (1), (3), (2), (1) respectively. Since each 
transformation represents a projectivity it follows that (4) repre- 
sents a projectivity. Hence we have 

Theorem II. A transformation given in a coordinate system on a 
line l by the equation 

, _ ax + b 
X cx + d * 


where a, b, c, d are in K and ad 4= be, is a projectivity. 

We now show that every projectivity on l can indeed be repre- 
sented by an equation of the form given in the above theorem. We 
first see what happens to the point x — oo under the five projecti- 
vities into which x' = (ax + b)j{cx+d) has been resolved. The first 
two leave it invariant, the third transforms it into 0, the fourth 
leaves 0 invariant, and the fifth transforms it into a/c. The point 
oo is therefore transformed into the point x' — a/c. Hence we 
derive a further property of the symbol oo: 


ax + b a 
cx+d c’ 


when 


X = oo. 
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Now let us suppose that a given projectivity transforms the points 
0, 1, oo into the points p, q, r respectively. The transformation 

r(q-p)x+p(r-q) 

* = (*-,)*+<,-,) (5) 

clearly transforms 0 into p, 1 into q, and, by virtue of the relation 
above, it also transforms oo into r. This transformation is also, by 
Theorem I, a projectivity, since 

ad-bc = (q-p)(r-q)(r-p), 

which is not zero if p, q, r are all distinct. But a projectivity is 
uniquely defined if we are given three pairs of corresponding points. 
The transformation (5) therefore represents the given projectivity, 
and we have proved 

Theorem III. Any projectivity on l is given by an equation 

ax + b 

X ~ cx + d’ 

where a, b, c, d are in K, and ad 4= be. 

As an immediate corollary to this theorem we see that the pro- 
jectivity in which (x v x'J, (x 2 , x 2 ), (x 3 , x' 3 ) are corresponding pairs 
is given by 

(a-Si) (x 2 —x 3 ) = (#'— a^) (x 2 -x’ 3 ) 

(x-x 3 )(x 2 -x 1 ) (x' — x' 3 ) (x'z — Xi)' 


Let us now consider a change of coordinate system on l. Let x 
be the coordinate of a variable point in the original coordinate 
system for which (0, E, U) is a scale, so that these points are given 
by x = 0, l,oo respectively. Let (O', E', U') be a new scale on l. 
This does not of itself fix a new coordinate system, since we still 
have to set up an isomorphism between the points on l other than 
U' and the elements of the ground field K so that O', E' correspond 
to 0, 1 respectively. There may be several ways of doing this. For 
example, if K is the field of complex numbers and we replace every 
element of K by the conjugate complex element we satisfy all the 
conditions and obtain a new coordinate system. But we can deter- 
mine a particular isomorphism as follows. There exists a unique 
projectivity operating on the points of l which takes O , E, U into 
O', E', U' respectively. If this takes P into P' we associate with P' 
the element x which is the coordinate of P in the original coordinate 
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system. This correspondence between the points of l and the 
elements of K is clearly an isomorphism, and the coordinates 0, 1 
are associated with O', E' respectively. 

We now find the coordinate y of a point X in the new coordinate 
system in terms of its coordinate x in the original system. If Y is 
the unique point of l such that 

(0,E, U, Y) A (O’, E', U',X), 

then y is the coordinate of Y in the original coordinate system. 
Since the correspondence between X and 7 is a projectivity, 

ax + b 

by Theorem III, where a, b, c, d are in K, and ad 4= be. 

Next, consider a projectivity between two distinct lines l and V, 
such that 

(O, E, U,P,...) A (0',E', U',P',...). 

If a coordinate system exists on l for which ( 0,E,U)is& scale, then 
the projectivity between l and V sets up a coordinate system for 
which (O' , E' , U') is a scale, corresponding points of the two ranges 
having equal coordinates. The projectivity is given by 

x' = x 

in the two coordinate systems chosen. 

We have stressed the point that the selection of a scale (O, E, U) 
on a line does not, of itself, fix the coordinate system. But, having 
determined a coordinate system on any line l with respect to the 
scale (0, E, U), we can determine a unique coordinate system on 
any line l' with respect to any given scale (O', E' , U') on that line, 
by using the unique projectivity which makes (O, E, U ) correspond 
to (O', E', U') respectively, as above. The coordinate systems on l 
and l' related in this way may be described as related coordinate 
systems. 

Let us fix, once and for all, a coordinate system on a definite line 
l of the given space. For brevity we shall call any coordinate system 
related to this chosen one on l & related system. If A 0 , ...,A n , E is 
a reference system, and we use the related coordinate system on 
A i A j for which the scale is (A it E if , Aj), we can introduce coordinates 
into the space as in §8. We call this coordinate system a scale 
coordinate system of the space. 
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Now let m be any line of the space. It cannot meet all the S n _ 2 

spaces A. ^ A <bj , spanned by vertices of the simplex of reference, 

and we may suppose that it does not meet that determined by 
A 2 , A n . Then if it meets x 0 = 0 in P, x x = 0 in Q, the points 
P, Q are distinct. By suitably choosing the multipliers, we may take 

P = (®> If Pji 

and Q = ( l,0,q 2 , ...,q n ). 

Let R = (1,1, p 2 +q 2 , —»!>»+?»). 

This point is on PQ and in the S n _ x determined by A 2 , ..., A n) E. 

Now let v , . 

X = (x 0 , ...,x n ) 

be any point of m other than P. Then x 0 4= 0, and we can write 


X = Q + Px x x o 1 = Q + Pi. 

We now have a one-to-one correspondence between the points X 
of PQ other than P and the elements i of K. We show that this 
correspondence is that determined by the related coordinate system on 
m with scale ( Q , R, P). 

Project the range on m from the S n _ 2 formed by A 2 ,...,A n on 
to the line A g A v The points Q, R, P project, respectively, into 
A 0 , E ol , A v and X projects into the point (1, i, 0, ..., 0), which has 
i as its scale coordinate. This proves the result. 

Now let L, M, N be any three distinct points on m, and choose 
the multipliers of their coordinates so that 

M = L+N. 


Then, if Q = La+Ny 

and P = L/S+NS, 

then aS — fly #= 0, 


since P, Q are distinct. Also, the point X on m can be represented 


by the symbol 


X = L + Nrj, 


where 


V = 


y+Si 

a+/?r 


Since this last relation is of type (4), it follows that the correspondence 
between the points X of m and the elements y of K (and oo) is that defined 
by the related coordinate system on mfor which the scale is ( L , M, N). 
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Now carry out an allowable transformation of coordinates 
x' = Ax (A = (a^)). 

If, in the original coordinate system, 

^ = 

N = (n 0> ...,n n ), 
and X = (x 0 ,...,x n ), 

where x { = li + n^ ( i = 0, ...,n), 

the new coordinates of L, N, X are 
L = (l 0 , ...,l n ) t 
N = 

X = ( x' 0 x' n ), 

n 

where l\ = "Eocylp 

i “0 
n 

n'i = ^CCyUp 
i - o 

n 

x’i = 

i-o 

and therefore 

x'i^li + n'iTi (i = 0, 

Hence, the correspondence between the points I of m and the 
elements ij of K (and oo) is unaltered by an allowable transformation 
of coordinates. Since this result is true for every line of the space, it 
follows that an allowable transformation of the coordinate system 
merely changes the simplex of reference used to construct the scale 
coordinate system. Finally, since a scale coordinate system is 
uniquely defined when we are given the reference system A 0 , .. .,A n ,E, 
and an allowable transformation is uniquely defined (save for 
a common multiplier) when the points with new coordinates 
(1,0, ...,0), (0,1,. ..,0), ..., (0,0,..., 1), (1,1,. ..,1) are given, it 
follows that the set of transformations of coordinate systems from one 
related system to another coincides with the set of allowable trans- 
formations of coordinates. 



CHAPTER VH 


GRASSMANN COORDINATES 

We now return to the algebraic methods used in Chapter V. From 
now on we shall assume that the ground field K is commutative, and 
without characteristic. Indeed, the main reason for considering 
more general fields in Chapter V was that this cleared the way for 
the detailed analysis of the Propositions of Incidence which was 
made in Chapter VI. It is not necessary to assume in this chapter 
that K is algebraically closed, since all the algebraic operations 
which we shall use are rational. 

1. Linear spaces. In V, § 4, we defined a linear space of k dimen- 
sions in PJj(A), and in the subsequent sections we obtained certain 
elementary properties of linear spaces. We indicated how similar 
definitions and properties could be obtained for P l n (K), and it was 
clear that the two sets of definitions and properties coincide when 
K is commutative. In this chapter it will not be necessary to 
indicate the ground field explicitly. We shall denote the space with 
which we are concerned by [»]. Linear spaces in it will be denoted 
by S a , 8 b , ..., the suffix denoting the dimension. 

We recall that if 

A 1 = (aj, ...,<4) (i = 0, ...,k) 

in a given allowable coordinate system, the points A 0 , ...,A k are 
linearly dependent if and only if the matrix 



is of rank less than fc+l. By definition, any point (a 0 , ...,a n ) is 
linearly dependent on A 0 , ..., A k if and only if there exist elements 
A„, ...,A fc of K such that 

k 

^ = SA i a| (i = 0, ...,n). 

j =o 

We saw also that in a linear space of k dimensions there exist sets 
of k + 1 independent points, but no set of h independent points if 
h> k+ 1. 
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Now let A 0 , ...,A k be a basis for S k . The points B°, ...,B k of S k 
can be expressed in the form 

tf-SAU* ( i = 0,...,k ). 

1 - o 


As we saw, the set of points B°, B k forms a basis for S k if and 

A = (Aj) 

and B = (/?}), 


only if the matrix 
is non-singular. If 


the space S k spanned by A 0 , ..., A k is determined, in an obvious 
sense, by the matrix A, or, when A is non-singular, by the matrix 

B = AA. 


Thus S k is determined by a set of equivalent (k + 1 ) x (n+ 1 ) matrices 
A, B, ..., equivalent matrices being such that we can pass from any 
one to any other by premultiplication by a non-singular square 
matrix. 

Let us consider an allowable transformation of the coordinate 
system n 

x* = 2 T tj x f = 

l-o 

or x* = Px, 

in matrix notation. If, in the new coordinate system, 

A i = (a£*, 

n 

then af l = 2 r jk o 4, 

k =0 

and hence in the new coordinate system S k is determined by the 
set of matrices equivalent to 

A* = (af*) = AP'. (1) 

Thus in \n\ a linear space S k is determined by a set of 
(k + 1) x (n + 1) matrices which are equivalent (in the sense described) 
in a given allowable coordinate system. The law of transformation 
for allowable transformations of coordinates is given by (1). How- 
ever, for many purposes the identification of a space by means of 
a set of equivalent matrices is not the most convenient method, and 
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it is our purpose in this chapter to consider a more suitable system 
of coordinates for the ^-spaces of [to]. These coordinates were first 
introduced by Grassmann, and they coincide, when k = 0, with the 
coordinates already defined. 

2. Grassmann coordinates. We choose an allowable coordinate 
system and keep it fixed. Then a given S k of [to] is determined by a 
set of equivalent (&+ 1) x (to+ 1) matrices A,B,... which are all, 
of course, of rank k + 1. Suppose that 

B = / IA, 

where A is a non-singular (k+ 1) x (k+ 1) matrix. We consider the 
(k+ l) x (k+ 1) submatrix of A formed by selecting the columns 
with suffixes i 0 ,i v ...,i k> and the corresponding submatrix of B. 
We have 

m o . =a 

Mo • au 

vi • kJ 

and taking determinants of both sides, we find that 

fit. ■ fit, -m 

Pi ■ pi, 

where | A | = detA is different from zero, and does not depend on 
the suffixes i 0 , ...,i k chosen. We write this equation in the form 

= M I a i 0 ...ilc‘ 

Since A is of rank k + 1, for at least one set of suffixes i 0 , 

and then b io it 4= 0. 

Again, from the elementary properties of determinants, 

if the integers i 0 , are not all distinct; and if j 0 , ...,j k is any 
derangement of the set i 0 , . . ., i k , 

a i 0 ...ik = ea U -Jk> 

where e = + 1 or — 1 according as the derangement is even or odd. 
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From these considerations it is clear that the set of equivalent 
matrices which determines S k also determines a set of 

equivalent ^ jj -tuples (...,a io __ Ak , ...), (..., ...)• We define 

the 0ra8smann coordinates of S k to be any one of the equivalent 

(& + l j -tuples. Hitherto, in using the term coordinates in connection 

with points we have implied (i) that each point has a set of co- 
ordinates which is unique except for a common multiplier, and 
(ii) that any (»+ l)-tuple gives the coordinates of some point. This 
property (ii) is not characteristic of Grassmann coordinates, and we 

shall see later that except when k = Oork = n — lan jj -tuple 

•••) can onl y gi ye the coordinates of an S k when the 
a i# ik satisfy certain algebraic relations. The property (i) of 
coordinates described above is, however, an essential one, and, 
before proceeding, it is of fundamental importance to show that 
no two distinct spaces S k , S' k can have the same coordinates. We do 
this by showing how to write down the equations of n — k linearly 
independent primes through a given S k> and remark that these 
equations depend only on the coordinates of S k . 

Suppose that S k is determined by the basis 

A i = (o4,...,<4) (i = 0, ...,k), 

and let the Grassmann coordinates of 8 k be ...). One at 

least of these coordinates is not zero. Suppose that 


the matrix 


Pi 0. 

.JlC 

+ 0. 

)oint 

(X 0 ,..., 

/<A 

• 

a?/ 

u 

• 

• 


Vo 

• 

n, 


should be of rank 1. In this case the determinant of the sub- 
matrix formed from the columns with suffixes i 0 , ...,i k+l must be 
zero. Expanding this determinant in terms of the last row, we get 
the equations 

u+i 
h - 0 


HP I 


19 



290 


VH. GRASSMANN COORDINATES 


These equations, for all possible choices of i 0 ,...,i k+1 , are either 
identically true, or define primes which contain all points of S k . 

Hence the aggregate of these ^ + 2 ) e< l ua ^ ons defines a linear 

space which contains S k . In order to show that these equations 
actually define S k itself it is therefore sufficient to show that 
amongst them there is a set of n — k linearly independent equations. 
We consider the equation of the set for which 

(V •••»*£+ 1 ) = (jo> •••>Jk’j)> 

where j is any integer 0 < j < n distinct from j 0 , . ..,j k . This equation is 

2 ( “ 1 ) hx j h Pi*...)h-iJh + i -hi + ( - 1 )* +1 x jPio...jk = °- 

h = 0 

Since each Grassmann coordinate is skew-symmetric in its suffixes 
we may write the equation in the form 

2( ~ 1 ) h x jh { - 1 ) k ~ h p u ... ... + ( “ 1 ) fe+1 Wu-h = 

/l** 0 

k 

or = °- ( 2 ) 

h— 0 

If is a derangement of (0,...,w), we consider 

the equations (2) for the values of j given by 


J Jk+ 1> 

The matrix of coefficients of these equations is an (n — k)x(n+ 1) 
matrix. It is of rank n — k, for the determinant of the submatrix 
corresponding to the columns with suffixes j k+v ..., j n is equal to 
(Pj 0 ...j k ) n ~ k , and this is not zero. It follows that the set of equations 
(2) defines n — k linearly independent primes, and therefore 
defines S k . 

The equations of S k are therefore determined by the Grassmann 
coordinates of S k , and it follows that no two distinct ^-spaces can 
have the same coordinates. 

We conclude this section by considering the effect of an allowable 
transformation of coordinates in [ri\ on the Grassmann coordinates 
of S k . We saw in § 1 that the transformation 

x* = Px (3) 

transforms the matrix A which defines S k into the matrix 

A* = AP'. 
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Hence 


Pi,. ..Ik 


*0 


a' 




at 0 

.ik 


>y ^ k 

a ik 


& 7i 

• • • 

ETf #ft a^ . Sr iiA a^ 
/& A 


= s 

h 0 ,...,hk 


'io h 0 


iohjc 


f ikK 


'ijchjc 


by II, § 8, Th. V, and this is equal to 


„o 


a A 0 


s 

A-o, •..,hf c 


' io ho 


T io hk 


T ikho 

T ikhk 


Ph 0 ...hf 


<k 




(4) 


Hence an allowable transformation of coordinates in [n] induces 
a linear transformation of the Grassmann coordinates of a fe-space. 
The matrix of this transformation has as its elements the minors 
of order k + 1 which can be extracted from P. Let all minors which 
come from the same set of k + 1 rows (or columns) of P be placed in 
the same row (or column), and let the priority of elements in rows 
or columns be decided on the principle by which words are ordered 
in a dictionary; that is, let the order be lexicographical order. The 




matrix is called the (k+ l)th compound 


of P, and it is denoted by P fc+1) . From the reasoning given above 
it follows easily that 


(PQfk+l) = p(fc+l)Q(fc+l>. (5) 


If P = I n+1 (the unit matrix), evidently 

P fc+1) = ln + u 

\k+l)‘ 

Hence, taking Q = P -1 , we see from (5) that 

/(»+!) = (4+i) (fe+1) = ^ fe+1) G (fc+1) , 


and therefore the matrix 
is the inverse of P fc+1) . 


(p-i^(fe+D 


19-2 
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These compound matrices are of importance later on, when we 
consider them as defining a particular type of allowable trans- 
formation in an associated iV-space, where N+l = 



3. Dual Grassmann coordinates. An S k may be defined not 
only by k + 1 linearly independent points which lie in it, but also 
by means oin — lc linearly independent primes which pass through it. 
In a given allowable coordinate system let S k have Grassmann 
coordinates . . 

n 

and let 'Zufaj = 0 (i = 1 , — k) (1) 

j =o 


be the equations oin — k linearly independent primes which contain 


S k . If 


U = («|), 


the equations (1) may be written in matrix form as 


Ux — 0. 


( 1 )' 


These equations are the equations of S k , and the (n — k)x (n+l) 
matrix U is of rank n — k, since the primes are linearly independent. 
Any prime through S k has for its equation a linear combination of 
the equations (1), and n — k such primes, given by the matrix 
equation „ _ 


are linearly independent primes through S k if and only if there 
exists a non-singular (n — k)x(n — k) matrix M such that 

V = MU 

that is, in the terminology of § 2, if and only if U and V are equivalent 
matrices. Thus 8 k can also be determined by a set of equivalent 
(n—k) x (n+l) matrices. 

If, as in § 2, we consider the transformation of coordinates 

x* = Px, (2) 

the equations of S k become 

U P -1 x* — 0, 

and the set of equivalent matrices U, V, ... becomes the set of equi- 
valent matrices UP~ X , FP -1 
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Again, the determinants of the (n—k)x(n— k) submatrices of 
the set of equivalent matrices U, V, ... determine a class of equi- 
valent -tuples. A representative member of the class can be 

determined from the matrix U. We write 


, 1 * 1 ... i n -k = 


p *1 — 


< • <-* 


U 


n—k 


U 


n—k 

in-k 


( 3 ) 


Then S k is determined by the set of ^ + -tuples 


We call these dual Orassmann coordinates. 

It is easy to show that no two distinct spaces S k , S k give rise to 
the same set of dual coordinates. If (x 0 ,...,x n ) is any point of 
S k , then 

n—k n 

2 2 AX** = 0, 

i- U«0 

where \ v A n _ fc are any elements of K. Now choose 


& 

II 

T 

< • 



■ 

u U-k-x 


< +1 • 

U i+1 

' U U-k- 1 


uf~ k 

n.n—k 

■ u U-k-i 


and we obtain the equation 

2 ph-ln-k-ilxj = 0. (4) 

i - o 

If we give^, . ..,j n _ k _ t all possible values, (4) gives us a set of equa- 
tions defining a linear space which contains S k . As in the previous 
section we now show that there are n — k linearly independent 
equations in the set, and deduce that the linear space coincides 
with S k . 
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Since U is of rank n — k, there is at least one non-zero dual 
coordinate. Suppose that 

p<*i-On-k^0 9 

and consider then — k equations obtained from (4)when ...,j n -k-i 

are selected from the set a x , ..., a n _ k . If is the element omitted in 
the selection, (4) becomes 

n 

2 p a i"‘ a i-i a i+i'" a n-k1 Xj = 0 , 

i-0 

that is, 

p a l...Oi-x<H+i'" a n-k a ix ai -\- 2 p°l ••• a t-i a i+l — a n-ki Xj = 0, 

an-k 

Or p a l — a ”~kX ai + 2 p a l — a i-d a i+l — a n-kXj = 0. 

3+0, On-* 

If we take i = 1 , . . . , to — k, we obtain n — k linearly independent equa- 
tions, since the determinant of the submatrix of the (to — k) x (n + 1) 
matrix of coefficients which corresponds to the columns a v a n _ k 
is evidently 

(pax... an-^n-k' 

and this is not zero. The dual coordinates of S k therefore determine 
it uniquely. 

These dual coordinates are closely related to the ordinary Grass- 
mann coordinates defined in § 2. We now prove 

Theorem I. Let (i 0 , . . . , i n ) be any derangement of (0, ...,n). Then 
if 8 k has Orassmann coordinates (...,p,- o fi , ...), and dual Grassmann 
coordinates (...,p**+i •••*», ...), there exists a non-zero element p in K 
such that 

qu...i k =%...i n P i,c+1 ~ in =PPio...ik- ( 6 ) 

This equation defines q io __ iJe . The proof is in two stages. We show, 
first of all, that the transformation (2) of coordinates in [to] induces 
a non-singular linear transformation in the coordinates of 

the same kind as that induced in the coordinates ^ [cf. §2], 
We then show that P can be chosen so that in the new coordinate 
system it is evident that 

<...<*= PPL..** 

for some p. The theorem then follows from the fact that the co- 
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ordinates •••) and ...) are transformed in the 

same way for each allowable transformation in \n\. 

Since the transformation (2) transforms the equations of S k into 


if 

then 


XJP-^x* — 0, 

P = (T tj ), and P" 1 = (o-y), 


••• in-k 


1 


'ZUj- k °'Hn-* 


= S 

ili •••tin—k 


U 


U. 




n—k 

ii 


U: 


= ^ ph*”1n-k 

Jit •••tin-k 


in-k 


«£f 


® 1n-kH 


VjlU 
• • 

^hin-k 

^In-kin-k 


^hin-k 
1 T Jn-kin-k 


Hence 


C..ii = %...i n P* ik+1 '" in 





• °V*+i*n 

• • 


• °v»<» 

Q’jlc+itk+i 

• ^ik+iin 

Vjnik+i 

• °V»<n 




^Jk+lik+l 

^inik+i 


<» 


the summation being over all the distinct partitions 

0 # 0> •••yjk) (jk+l> •••yjn) 0 ^ Oo> se ^ s (^0 Oo> 

are, of course, derangements of (0, 
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But [II, § 8, Th. VIII], 


^Jk+lii+l 

* V’jk+iU 

• • 

II 

gs. 

i • 

.L 


* T ikio 

°Vni*+i 

* O'jnin 


T *oik 

’ T ikik 


where I P I = det P. Hence 


0 •» •••» Jk 


T io1o 

T ioii 


T ikio 

T ikik 


Qio — ik' 


( 6 ) 


Comparing this with the result of § 2, (4) we see that the allowable 
transformation of coordinates in [n] given by (2) induces the same 
linear transformation on the coordinates (...,& i]c , ...) as it does 
on the coordinates (•••> •••)> save f° r a common non-zero 

factor. 

Now, by V, §4, Th. V, there is an allowable coordinate system 
in [»] in which the points A 0 , ...,A k , which form a basis for S k , 
coincide with vertices X* 0 , . . . , X* k of the simplex of reference of an 
allowable coordinate system. Let us suppose that the transforma- 
tion (2) is to this coordinate system. Then S k is now given by the 
(&+l)x(w + l) matrix 



1 



9 


and therefore p* k — 1, but p*...i k = 0, unless the set (i 0 , ...,i k ) is 
a derangement of (0, ..., k). Also, S k is given by the equations 

x { — 0 (i = k + 1 n), 

that is, by the (n — k) x (n+ 1) matrix 


r • 

i 

0 

• 

•> 


0 

1 

• 

• 

1 : 

0 

• 

• 

• 

• 

• 

1 ) 

p*k+l.. 

.71 — 

1, 

but 

< 


(i 0 , ...,**) is not a derangement of (0, ...,k). In this special system 

of coordinates * * 

?*....** = 
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and therefore, as explained above, we conclude that in any allow- 
able coordinate system the relations (5) hold for a suitable p. 

The Principle of Duality [V, §5] throws further light on the 
relation between the Grassmann coordinates of an 8 k and its dual 
Grassmann coordinates. If the points 

A* = (< 4 ,...,< 4 ) (i m 0,...,k) 

form a basis for S k , the equations 

n 

Ytcdxi = 0 (i — 0 , ...,&) 

i - o 

define the dual space Z n _ k _ 1 . Hence, if 

are the Grassmann and dual Grassmann coordinates respectively 
of S k , and 

(•••» ?< 0 . and 

are the Grassmann and dual Grassmann coordinates of £ n - k -i, 
then evidently , , 

qH-H = pp i0 ' ik , 

and therefore, by our theorem, 

P ix '"* n ~ k = Wh.-.U-k' 

Using these results it is possible to read off" a property of Grassmann 
coordinates dual to any known property. For instance, it follows 
at once from the fact that the equations of S k are [§ 3, (4)] 

n 

2 plv“1n-k-x1 x* = 0 , 

i-0 

that if the vector 

iPh: • in-*- 1 0» Pit ... in-*- Ph ... in-*-in) 

is not zero, it gives us the coordinates of a point in the 
whose Grassmann coordinates are (...,^ i <n t , ...). By giving the 
set of integers (j v ...,j n -ie-i) all possible values we obtain a set oi 
points in which contains a basis for this space. 

4. Elementary properties of Grassmann coordinates. The 

condition _ 

Pa,. ..a* = °» 

for a given set of suffixes (o 0 , ...,a k ), is easily interpreted as a 
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geometric property of the S k which has the Grassmann coordinates 
For if 

= (a 0 , *«•) cc'f) x 0| •*(} A?) 
is a basis for S k , the condition is 


a 


a 0 


a 


a* 


0. 


a; 


a 0 


a 


fc 


From this statement of the condition we deduce the existence of 
elements A 0 , ..., of K y not all zero, such that 


^ 0* — •••» 

i-0 




fc 

It follows that 2 ^.4* is a point in S k whose coordinates satisfy 

t-0 


the conditions 


^Oo = *«, 




a k 


= 0. 


These last equations define an 8 n _ k _ ly a basis for which consists 
of n — k vertices of the simplex of references. The condition we are 
examining is a sufficient condition for S k to meet this S n _ k _ v Con- 
versely, if S k meets this S n _ k _ l9 one of the points, say A k y of a basis 
for S k can be chosen in S n _ k _ v and then 

* 5 , = — = aa k = 0 , 

and therefore p a<t _ ak = 0. 

This result can be generalised: 

Theorem I. Let a 0 ,...,a s be 5 + 1 distinct integers (« + 1 < k + 1 ) 

chosen from the set 0 n; and let S n _ s _ x be the linear space whose 

equations are 

*o. = ••• = = °* 


The necessary and sufficient conditions that S k should meet $ n _ g _j 
in a space of dimension at least k—s are 


Pat,...a a i a +i...ii ~ ® 

for dll possible choices of i s+1 , . ..,i k . 

Clearly we need only consider sets (i g+1 , ...,i k ) which consist of 
distinct numbers, none of which is equal to any member of the set 
(a 0 , ...,a g ). After what has been proved above we may also suppose 
that 8 < k. 
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Now let us assume that S k and 8 n _ ^ have a linear space of at 
least k— a dimensions in common. Then we can find k—s+1 
linearly independent points of S k in S n _ s _ v and we may include 
these in a basis for 8 k , say as the points A s , A s+1 , ..., A k . Then 

aj = 0 (i = s, .. .,k;j = a 0 ,...,a g ). 

It follows that in the matrix A = (aj) which defines 8 k the columns 
corresponding to the suffixes a 0 , ...,o g have non-zero elements only 
in the first s rows. The rank of the submatrix formed by these 
columns is therefore not greater than s, and hence these columns, 
considered as vectors, are linearly dependent. Therefore the 
columns corresponding to suffixes (o 0 , ...,a g , i g+1 , ...,i k ) are also 
linearly dependent, and finally 


Pa 0 ...a,i >+1 ...lk ~ 

This proves the necessity of the condition. To prove its sufficiency 
we recall that any S k intersects an 8 n „ t in a linear space of at least 
k — s — 1 dimensions. Hence we can choose the points A s+1 , ...,A k 
to lie in 8 n _ e _ lf so that 

ocj = 0 (i = s+1, ...,k;j = a 0 , ...,a a ). 


Then evidently 




a 




a 


a s 


a; 


a 0 


a; 


a s 


a: 


s+l 




using elementary properties of determinants. 
Not all the determinants 


a: 


, 5+1 


a: 


ik 


a: 


. 5+1 


a- 


s+l 


ot k 

a »*+i 


a 




can be zero. If they were, the last k—s rows of A would be linearly 
dependent. Since k — s>0, this would imply that the rank of A 
was less than k+l, contrary to hypothesis. Hence the equations 


imply that 


Pa*... a s i a+l ...lk ~~ ® 


a 


<h 


a 




= 0. 


cc% 


cC 
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It follows that there exist elements A 0 , ..., A fc of K, whioh are not 

8 

all zero, such that the point £ A iA l , which is a point of 8 k linearly 

i-0 

independent of A a+1 , ...,A k , lies in S n _ a _ v Hence 8 k and <S n _*_i 
have at least k—s + 1 linearly independent points in common, and 
therefore intersect in a space of at least k—s dimensions. This 
completes the proof of the theorem. 

We now show how to determine a basis for an S k when we are 
given its Grassmann coordinates •••)• Let be a 

non-zero coordinate, and define 

= ai +l ...ait (* = 0, ...,&, ^ = 0, ..., ?l). (1) 

Since <• = Pa 9 ...a k *0, 

the (n + 1 )-tuple B i — (x\, ...,x i n ) 

represents a point, which we denote by B i . Since 

%J — 0 (j = O 0 , ••• ><*<_!> •••><*&)> 

the point B i lies in the S n _ k whose equations are 


X) = 0 (j = a 0 , ...,a { _ v a i+1 , ...,a k ). (2) 


We show that J5 f is the unique point of intersection of S k with this 
S n _ k . Since 


vE. 

II 

a 0 

oft 

• 1 

a i 

oft 

a «i+ 1 

oft 



oft 

fift 

• 1 


oft 

1 

oft 

* 


h 

k 

S(- 

-0 


oft 

^a 0 

• 

oft 

a aj_i 


/yO 

» • 




oft-1 

a a 0 

• 

a' 1 ”" 1 

a a<-i 

a' 1 ” 1 

a *-i 

> ®a* 




< +i 

• 

u a i - 1 

a *+l 

. < +1 


1 . 


< 

• 

oft 

oft 

• < 


= ZA£o$, 

fc -0 


it follows that B l = 2 is a point of S k . If S k meets the S n _ k 


ft = o 


given by (2) in more than the point B 1 , it has at least a line in 
common with it, and this line, lying in 8 n _ k , must meet the $ n _ fc _ x 

« ivenby *, = o y = 
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This would then meet S k , and by Theorem I we should have 

Pa a ...a k = 

contrary to hypothesis. It follows that B i is the unique point 
common to S k and the S n _ k given by (2). 

We now show that the points B°, B k defined in this way form 
a basis for S k . Since each B i lies in 8 k it is sufficient to prove that the 
k + 1 points are linearly independent; that is, that the matrix 

X= /x° 0 . x° n 

\ /y 


is of rank & + 1. The submatrix of X whose columns correspond to 
the suffixes a 0 , ...,a k is 

( Pao...a k 0 . 0 

® Pa a ...a k • 0 

• • • • 

® * • Pa 0 ...a k 

(since x i ai =p a and this submatrix is non-singular, since 
Pa 0 ...a k * 0 - Hence X is of rank &+1, and the points B°,...,B k 
form a basis for S k . 

An immediate consequence of this result is the fact that there 
exists a non-zero element pinK such that 


PPio — ik 


X, 


u 


x. 


i k 


I 4 • 4 I 

for all choices of (i 0 , ...,i k ). If we take 

this equation becomes 

PPaf,...a k ~ (Pa 0 ...a k ) k+ *> 
and therefore p = (p a<) ...a k ) k - 

Again, suppose that ij = a t , except when j — t,u,v, there 

being s elements in this last set. Then 


ijg 

< 

• 4 

x o 

• X lu 

4 

. < 


< 

. 4 

r*JC 

4 

'I? 

• 
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In any column corresponding to the suffix a t every element other 
than x^ is zero. Since a determinant can be expanded in terms of 
the elements of such a column, and since x ^ = p a<t ... at is on the 
principal diagonal, 


that is, 


PPi 0 ... it (Pa 0 ...ak)* *"*"* 


(Pa 9 ...at) g l Pi a ...ii 


4 . 


a» 


/j A, 

*i w 




4 . . 4 - 


( 3 ) 


XI 


T w 

X i» 


If we dualise these results we can obtain a basis for the primes 
which pass through S k . But we may also use the results of § 2. If 
(a 0 , . . . , a n ) is an even derangement of (0, 


pa k+1 ...a n = Pao ^*0, 


by hypothesis. By § 2, (2) the equations 

k 

Pa 0 ...ak X i ^ Pa 0 ...ah-iiah+i...ak X ah = ® = a k+ 1 > 

h =0 


are the Equations of a dual basis for S k . Since for the given values 
of i we may write 

i = a j (j = k+1, 

and then 


P, 


0,0 ... dfr - 1 dj dh + 1 ... dJt 


— pd>ic + !••• «j-i dhdj +1 ... On, 


the equations become 

k 

p^k+i — a nX a .+ 2 p a/c+1 "' a i-i a h a j+l — a nX ah =0 (j = k + 1 , . . . , ft), 

3 fc-0 


or, more simply, 

n 

2 p a M — a )-i ia )+l'" a nx i = 0 (j = k+1, 
i-0 

on inserting terms with zero coefficients. We write 

Uj = p a k^l ■■■ a j-li a )+v a n } 

and the equations of S k are then 


n 


E n) 

i-0 


x { = 0 


(j = k+1, 


»). 


( 4 ) 
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The matrix 

U = /4+1 • 


303 


defines S k , and 


f U k+l • tt k+l\ 
\ < • <) 


pptk+i“»in ss 


4*-tt 


wi*+« . 


4+i 


4 


Since 


ttf 1 = p a ie+ 1—°» (j = & + 1, 


pp a k+i“*On s= 


p a k+i ••• a n 0 • 

0 p a k+i ••• 0 


0 • • 

ss ( p a k+l ••• 

and therefore we may write our previous result in the form 


(p a k+i ••• k—lpifc+i •••tn = 


4*-ft 


<* + ‘ 


4+i 


wi 


If ^ = a j} except when j = t,u,v, ... 
being s elements in the set (i, u,v, ..., 

(p a k+l‘“ a n'j 8 ~^ pik+i ••• = 


, u>, we find, as before, there 
w>), that 


4 . uj w 

• • • 

4 ♦ 


(5) 


Finally, we note the following relations between the uj defined 
in (4) and the x] defined in (1) above. 

(i) If j > k, and i < k, 

uf = p°*+ 1-“» = p a# ... ai = <; 


(ii) if i,j >k,i =t=j, and Z, m ^ £ =f= m, 

K = o = *L; 

(iii) if i > fc, j < &, 


= p a k+i — °» 

“ Pa 0 .. . aj-iaiOj+i...a n 


- X ; 


Oi* 


We now use these results to obtain necessary and sufficient con- 
ditions for a given to meet a given ^ in a space of at least t 
dimensions. 
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5. Some results on intersections and joins. Let the coordinates 
of S h be •••) and let the dual coordinates of S k be 

(. . . , <^*+ 1 We prove 

Theorem I. Necessary and sufficient conditions that S h and S k 
have a linear space of dimension not less than t in common are 


2 q a k+i ••• a n-t ••• *>p kl " = 0 


A 1# ...» A* 


( 1 ) 


for all choices of a, k+1> a n _ s ; /? 0 , . . . , p h _ s , where s = h — t + 1 . 
A sufficient set of conditions is obtained from (1) if we restrict our 
choices of a k+1 , a n _ s to all possible selections from a k+v ...,a n , and 
fio> 1° possible selections from b 0 , ...,b h , where a k+1 , ...,a n 

and b 0 , ...,b h are sets of n — k and h + 1 numbers respectively chosen 
from the set 0 ,...,n such that 

f*+i™ a »p b9mbk + 0. ( 2 ) 

Clearly if 

9 ak+1 " an Pb 0 ...b h = 0 

for all sets (a n _ 8+1 , ...,a n ), (6 A _ 8+1 , ..-,6*), condition (1) is satisfied. 
We need only consider sets a k+v ...,a n \ b 0 ,...,b h , therefore, for 
which (2) is true. 

If we now write 

Uj = g a k+i"’ a j-ii a j+f" a n t 


and 


x 


1 Pb 0 ...bi- x jbi +1 ...b),’ 


the equations of 8 k can be written 

n 

'Zujxi-O (j = k+l,...,n), 

i — 0 

and the points 

x i = (x i 0 ,...,x i n ) (i — 0,...,h) 


form a basis for S h . Any point of S h can be written in the form 

h 

2 A a x a , and lies in 8 k if and only if 


2 2«^?A a = 0 (j - k+l,...,n). 

i-0 a= o 


(3) 


These equations in (A 0 , . . ., \ h ) determine the ( h + l)-tuples such that 

h 

2 lies in S k . Hence a necessary and sufficient condition that 
o 



6. SOME RESULTS ON INTERSECTIONS AND JOINS 305 


8 h and 8 k meet in an S t (at least) is that the equations (3) should 
have at least t + 1 independent solutions. The rank of the matrix 


must therefore be h — t, at most [II, § 6]. For this to be the case it 
is necessary and sufficient that every determinant oi s = h — £+1 
rows and columns extracted from this matrix should be zero. A set 
of necessary and sufficient conditions is therefore given by the 
equations . n n . 


A-0 


AA 


2 A A 


A-0 


= o, 


(4) 


2 A A 


A-0 


2 A A 

A=0 


for all possible selections of i v ...,i 8 from (&+ 1, ...,n) and of 
ji, ...,j a from (0 The above determinant, by II, §8, Th. V, 


= 2 
At, ..., A# 

A 

• A 


A, • A 

• • • 


A 

• A 


A, • A. 


Now, by equation (5) of the preceding section, 


A 


u 


Aj 


— e(q a k+l ••• q a k+ l ••• a n-s ••• 


u 


At 


u 


Is 


where oc k+v ... ,oc n _ 8 is a selection of n — Jc — s numbers of the set 
a k+i> -"> a n which is uniquely determined by the set i l9 . . . , i 8 , no two 
distinct sets i v ... y i 8 giving the same selection ot k+li ... y a n _ Si and e 
is +1 or —1 and is uniquely determined by i ly . . . , i 8 . Similarly, 
using equation (3) of the last section, 


A 


A. 




A, ■ A. 


where /? 0 , ...,/? fc _ 8 i s a selection from the set b 0 ,...,b h which is 
uniquely determined by j lt and e' = ± 1 is also uniquely 

determined by j v We can therefore write, instead of (4), 

2 ee'(< 7 a * +1 - a np bo ... J 3 - 1 q a ^ - “•-* A i - A *p Al A = °. (5) 

Ai> ...» A s 

where the factor ee', being independent of A x , . . ., A„, may be ignored. 


HP I 


20 
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Using (2) it follows that the necessary and sufficient conditions (6) 
are equivalent to equations (1) for all choices of a fc+1 , 
from a k+1 ,...,a n , and of /? 0 , ...,/? A _ a from 6 0 , ...,b h . This proves 
the theorem. 

We now obtain some formulae connected with the joins and 
intersections of linear spaces. Let S h have the coordinates 
s k coordinates (...,? i# .„ it , •••). and. as usual, 
denote the dual coordinates of S h by (...,p ift +i •••*», ...) and those of 
S k by (...,^+i-S ...). Let 

x 1 = (a:j, •••,*!) (i *0 h) 

be a basis for S h , and 

y* = (yl-'-tyi) (*-o, 

be a basis for S k . A necessary and sufficient condition that S h and 
S k meet is that there exist elements A 0 , ...,A h and /t 0 , not all 

zero, such that h k 

0 0 

A necessary and sufficient condition, therefore, is that the matrix 


4 

y o 
Wo 



(6) 


should be of rank h + k + 1 at most, since the rows are to be linearly 
dependent. Hence all determinants of h + k + 2 rows and columns 
extracted from this (h + k + 2) x (n+ 1) matrix must vanish. If we 
consider the determinant which corresponds to the columns with 
suffix *o> and expand it by Laplace’s formula [II, §8, 

Th. II], we have 

S ^il'".*ih+*+\Pjt"'ihQh+i"-1h+k+i ~ 

J9f “MjA+i+i 

It can be verified that these conditions are equivalent to those 
given by Theorem I. 

If S h and S k do not meet, the points x °, . . . , x h , y °, . . . , y k are linearly 
independent [V, § 4], and determine the join of the two spaces. 
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Hence the determinants of h + k + 2 rows and columns extracted 
from the matrix (6) are the Grassmann coordinates of the join of 
the two spaces. We may write these coordinates as (. . . , r if> ___ ift+t , . . . ), 
where 

r i 0 ...ih+k+i = ? ^l'-’’’ih+*+\Ph"Jh^ih+i-.Jh+k+i‘ W 

J 0> ••• tdh + k + l 

Ignoring a common factor, we may also write these equations as 

r io — ih+k+l = ^-Pio — ihQih+i — Jh+k+l* (8) 

i 

where the summation is over all derangements j 0 , ...,jh+k+i °f 
i 0 , ..., i A+fc+1 , the positive sign being taken with even, and the 
negative sign with odd derangements. 

Introducing the dual coordinates, we have 

7iA + i...iA+*+i = (-1 

where (j 0 , ...,j h+k+v i h+k+2 , an even derangement of 

(V • • •> in)’ Hence 

r io...ih + k + x = r^k+t-in = S ( - 1 )( ft + 1 )( fe + 1 >^ 0 ... }h qh- h <»+*+•••• <n 

i 

x= )%p j jh qU-1hih+k+%-in. 

i 

Ignoring the common factor ± 1, we may write this as 

2 Pj Q j qh^1hih+k+f”in % 
j 

An alternative form is 

f^h+k+f^n z= 2 +pih+k+%”' i n1o”*hqh+i”*1n t 

This follows from equation (8), the proof being left to the reader. 
By duality, or by an exactly similar argument, we show that if 
h + k>n, and 

i 

then fa-^n-h-k = o (all i v ■••,h n -h-k) 

is the condition that 8 h and S k meet in a space of more than h + k—n 
dimensions, but if these spaces meet in a space of dimension 
h + k—n, the coordinates of their intersection are ( . . . , fa — i tn-h-k i . . . ) . 
Equivalent formulae can also be found for t io ^ ih+t n . 


tO-M 



308 


YD. GRASSMANN COORDINATES 


We now obtain the coordinates of the projection, from a vertex 
8 h , of a space S k on to a space 8 n _ h _ v We assume that S h does not 
meet S k . The projection is then the intersection of the join of S h 
and S k with 8 n _ h _ v The coordinates of the join of S h and S k are 


given by 


S ^uY..ih+*t\Pjo- -ih < ljh 


+l — ih+k+l > 


and if the coordinates of are (..., sv- l \ the coordinates 

...) of the intersection of this join with 8 n _ h _ x are given 
by the equations 


— Ik ? ? S Pjo ■■■ "**• 


In the case h = 0 the coordinates of the projection of S k from the 

71 

point b = (b 0 , ...,b n ) on to the prime £ B t x t = 0 can be obtained 
more simply. Let 0 

x* = (ag,...,a&) (i = 0, ...,&) 

be a basis for 8 k . Since b does not lie in S k , the points b, x°, ...,x k 
are linearly independent. Hence the points y°, ...,y k , where 

y^Ax'-^B^b (A = b,B, + ... + b n B n ^Q) 

are linearly independent. But since 

S B iV ) = AXB j x i j -(x B } xh A = 0, 

i j \ j 1 

the points y°,...,y k lie in the prime £ B^ = 0. Hence these points, 

i 

which lie in the projection of S k on to the prime, may be taken as a 
basis for the projection. Therefore 

h a ...ik = yl • A 
y\ • y\ 

yl • yl 

- Axl-X\ 0 . Axl-A\ k 

Axi'—A k b io . Ax$ k -A k b it 

A* = S Bf x\. 

1 


where 
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Expanding this determinant we obtain 


t, 




= A k+1 


4 


4 


A. • 4 


-A'lLbu 

j 3 


A. . 4 -. A ° 4 +. 

• • • • • 

A, • 4-i Afc 4 


a; 




V+i 


• 4 


and substituting for A* we find that 


— A k+1 P{ 0 .,'{ k —A k '£ l '2 l b ij B l p i() '"i._ 1 ( {j+1 " ik . 
j l 


We have therefore proved 

Theorem III. The coordinates (...,t io ik , ...) of the projection of 
the k-space (■■■,Pi 0 ...i k , ...)from the point ( b 0 , ...,b n ) on to the prime 

B 0 x 0 + ... + B n x n = 0 


are given by the equations 


h 0 ...ik ~ Api 0 ...i k — E£( — i) 1 b i) B l p H "" i , ti j +lmtk , 
j l 


where 


A = B 0 b 0 + ... + B n b n . 


6. Quadratic /^-relations. We have shown that any A-space 
8 k has a set of coordinates ...) in a given allowable co- 

ordinate system, but it does not follow that a set of + J ) elements 


Pio ik w hich are skew-symmetric in their suffixes, and are not all 
zero, are necessarily the coordinates of an S k . Indeed, this is not 
the case, except when k = 0 or k = n— 1. We must therefore 
consider what algebraic relations, if any, connect the coordinates 
of an S k . 

We first show that there can be no linear relation. Suppose that 


s 

•••» Ik 


u lo...ik ?io:.ik 


is a homogeneous linear form in the * J j independent indeter- 

minates P io .„i k (which are skew-symmetric in their suffixes) such 

thftt a » i v 

S ^io...ikPio...ik = °> (!) 

U ik 


whenever P io ik is replaced by the coordinates p io _ ik of any &-space. 
We show that each coefficient must be zero. Let us show that 
u ao...ak> f° r instance, is zero. 
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Consider the fc-space defined by the k + 1 points 
We see at once that 

Pi 0 ...it = 

unless is a derangement of a 0 ,...,a k , whilst p a<) ... ak — 1. 

The relation (1) becomes 

u a 0 ... at = 

Thus we have 

Theorem I. The coordinates of the k-spaces in [»] do not satisfy 
any linear relation of the form, 

, 2 . u i 0 :..ikPio"‘tk = °* 

There are, however, relations of higher degree satisfied by the 
coordinates of every S k in [»]. We assume that &=j=0, and k^n — 1, 

and denote by F(P) a homogeneous polynomial in the + jj in- 
dependent indeterminates (which are skew-symmetric in 

their suffixes), and denote by F(p) the result of replacing P io ik 
by the coordinates Pi 0 ...i k of an S k . We seek non-zero forms F(P) 
such that F(p) = 0 for every S k in [»]. 

Let be k distinct numbers which are chosen from the 

set 0, and let j 0 , be k + 2 distinct numbers chosen 

from the same set. We define 

fc+i 
A «=0 


and prove that 

for every S k . 
Let 


■Fii,-ik,Jo — Jk+SP) ® 


A* = (a^,...,at) (i = 0,...,k) 


be a basis for any given S k . Then 


^ix...ik, io — ik+SP ) 
k + 1 

= 2(-l)* < 

A-0 




a U-i a ik+i 


/y k /y k 

a i* 


/yfc jyk 

a iA+i 
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If At 1 is the cofactor of af A in the determinant • 




< 

may write 


< 

..ik,io — ik+SP) 

k k + 1 

= S S (-i) A ^ 

/i-OA-O 

k 

- s Af 

A 

< • 

i 


a i • 



a i ♦ 

<4 


>y 0 /y 0 



• • 

» 


a <* a iA 



* 

/y° 

a iA+i 

• <+, 

• «L. 

a iA+i 

• <4 + , 


* l m 

<4 + . . 

«JL 


Since every determinant in the above sum has two rows equal, 
every term in the sum vanishes, and therefore 




■ JtJP) 


0 . 


( 3 ) 


This relation simplifies when the numbers i v ...,i k include some 


°f io> •••>ifc+i- If 


\ ~ Jr (r = 


F i\ — ik, U...lk + S p ) Pi l — ik h P il tfc+i ^ )* +1 P h — ik ik+\ P i»i\ ••• iV 

all other terms vanishing. Since the indeterminates are assumed to 
be skew -symmetric in the suffixes we have 

F ii — ik,1t — Jk+i( P ) = P ii — ikSo P ii — ikJk+i~ P ii — ikio P ii-ikik+i ~ 

In this case there is no relation. We obtain non-trivial relations 
containing three terms if 

jo> • • • >jk + 1 = *l> • • • > ^k-V h ft 
respectively. Writing i k = h, we find that 


F h...ik,h.. 


• ik+i ( P ) ~ ( 1 P ii...ik-ihl P ii...ik~itnn 


+ ( 1)* ,„i k _ t hm P ii ...i k -iln "b ( P ii...ik-ihn P i 1 ...ik-ilm’ 


and therefore 


Pit ... ik-\ hlPii ... ik-imn^Pii ... ik~\hmPii ... 

~^~Pii...ik-ihnPii...ik-ilm = 
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We now obtain an alternative form for the relations (3). If we 

write . . ... 

• • • > V-i> •••>*'&> l 8>!) o> • • • >Jk 

in place of ... 9 i 89 

for any 0 < s ^ the relation (3) becomes 

2V. 

k+l 

23 (“"1 ^Pio — ia-iis+i — ikh-iPish — h-ih — ik ^ 

A =* 1 

or 

+ S ( - 1 ) A+fc '" 8+1 l>io...<*-xiA^+l 1 ) A 2 , *o...#A-l<*#A+l...fc = °» 

A = 0 

using the skew-symmetric property of the coordinates with respect 
to the interchange of suffixes. Hence (3) may always be written 
in the form 

k 

Vu...ikPu...h ~ ^Pi 0 ...i 3 -ihi»+i...ikPh...h-iish+i...ik' ( 4 ) 

A =» 0 


The relations (3), or (4), will be referred to as the quadratic p-relations. 
We now prove 


Theorem II. 


If (•••» Pi 0 ....h>~-) be 



dements of K which 


are not all zero, which are skew-symmetric in the suffixes and which 
satisfy the quadratic p-relations, then there is a k-space which has 
coordinates {...,p io ... ik , ...). 

If there is such a &-space it is, of course, unique. Since not all 
the coordinates are zero we may suppose that 


Pat-ai^Q- 

It will simplify our notation if we assume that 

a { = i (i — 0, ..., k). 

It will be evident that the method of proof can be extended to any 
set a 0 , ...,a k . We have, then, 
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and since we are dealing with homogeneous coordinates we may 
assume that — L If 

b) = (* = •••>&> j = 0, 


we shall prove that the A: + 1 points 


B i = ,6£) (i = 0, 


k) 


determine a fc-space with the prescribed coordinates. 

Since b\ = p 0 , k — l (i — 0, ...,&), the submatrix formed by the 
first k + 1 columns of the matrix (b*) is the unit matrix. Hence this 
matrix has rank k + 1 , and the points B°, ...,B k determine a &-space, 
which we call S k . If 


... ijt 


bi. 


bl 




b’f 


* 


the Grassmann coordinates of S k are •••)> and all that 

we must prove is that 

?*„•••** = Pio -ik ( a ^ ®o> •••>**)• 

Since the Pi a .,.i k are skew-symmetric in the suffixes, this result is 
independent of the order of the suffixes. Let the set i 0 , ...,i k contain 
the integers l v l 2 , ..., l 8 which are greater than k, and write the set 

— o* 1 , 1 %, j %+ 1, i,a 8 ,^ 8 + i> •••»&• 

Since b) = Sj ( j ^ k), 


the determinant which gives q f can be simplified by expanding 

it in terms of the elements of any column whose suffix is in the 
natural order of the integers. This column contains only one non- 
zero term, and this is on the principal diagonal, and is unity. Hence, 
noting which row and column are removed by this simplification 
from the determinant we are considering, we eventually find that 


?<<>•••** — 


b\\ 

k 


b\\ 


bk 


( 6 ) 


Our theorem is now proved by induction on s. 
If s = 0, . • n 7 

1/Q> • • • y 1 k — U,...,/C, • 

= 1 = Pi 0 ...i k > 


and 
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^ 8 ~ % ...tk = b\\ = Po...j 1 -ll 1 j 1 +l...k> 

so that the theorem is also true when 8 = 1. We therefore assume 
that if 8 < t, Pt t "' ik — - that is, 

Pi»...ik= b k • 

• • • . 

H* • M: 

Now suppose that 8 = t. Since the p io ,„ ik satisfy the quadratic 
p-relations we have, from (4), 

* 

Pio— itPo...k 2 

j -0 

where r is at our disposal (0 < r < k). Choose r so that i r = l t . Then 

Pi 0 ...ir-iH,+i...ik - °> 

unless^' = j v . . . ,j ( , these being the only missing integers of the set 
0, If now j = j x , say, p io ... ir _ lhir+1 ... ik has only t-l suffixes 
not chosen from (0, and hence, by the hypothesis of induction 


Pio.:ir-ij\ir+i — ik 

t>i ■ K-, 



■ K 


K ■ <■*-. 

bi 

»L. 

■ b l i 


b\\ 

• b\ U 


M?-' 

• bfc 

H 

1 

• 

bit 

♦ b\U 

b\V' 

b% +1 

• bfc; 

b t l 

k 

• bi 

bl 

(-1)^ 

b\\ • 

b L 


bft-' . 

bt; 



b tv 



b L 


on expanding by the last column, the only non-zero element in 
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which is = 1. Now p ofc = 1, and, since j = j A and i r = Z <( we 
have Po...j-ii T j+i...k — ML and so the quadratic ^-relation gives us 



Mi 

• ML 

A-l 


• • s 


Mh 

• ml -, 1 


bfr' 

• MLV 


Mi 

• ML 


Ml . ML bt ; 


bt 


Ml. 


Mi 


thus proving the theorem for s = i. This completes our proof of 
Theorem II. 

We have thus obtained certain necessary and sufficient conditions 
to be satisfied in order that a set of elements •••) °f -K 

may be the coordinates of a fc-space. In the course of doing this we 
have found certain conditions — the p-relations — satisfied by the 
coordinates of a &-space. This raises the question of other relations 
satisfied by the coordinates of a &-space, and the complete answer 
to this question is the subject of the next section. 


7. The basis theorem. This section is devoted to the following 
theorem: 


Theorem I. If F(P) is any homogeneous polynomial in Pio...U 
such that 

F(p) = 0 

for all S k , then 


F(P) = S A 




*1 — 


( P)F* 


ii — ikJo — h+i 


(P), 


(i) 


where F {l ik}o }k+l (P) is the quadratic form defined in §6, (2), and 
^ a homogeneous polynomial in P io _ ik . 

For the purpose of this proof we define a generic A-space in [n 
to be the fc-space defined by k + 1 points 

(M> •••>&») (i = 0,...,k), 
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where the bj are (n+ 1) (k + 1) independent indeterminates over the 
ground field K. Clearly, if F(P) is any form such that 


F 


K 

K 


• VL 

• • 

hH 


) 


= 0 , 


this relation remains true when the bj are specialised in any way; 
in particular, when the points (b \, . . . , 6* ) are specialised so that they 
become k + 1 independent points of any assigned fc-space. Hence 


for every S k . Conversely, if 
for every S k , 



F(p) - 0 

F(p) = 0 

K ■ b l 
• • • 

«. • 


) 


vanishes for all specialisations of the bj. Hence, regarding this last 
expression as a polynomial in the (n+l)(fc+l) indeterminates 
bj, since it vanishes for all specialisations of these indeterminates, 
it vanishes identically [III, § 8, Th. I]. Thus in order that F(P) 
should satisfy the conditions of the theorem, it is necessary and 
sufficient that F(p) = 0, where (...,p io ii , ...) are the coordinates 
of a generic k- space. 

Now let F(P) be a form satisfying the conditions of the theorem. 
We group together all the power products in F(P) which are isobaric 
in each of the suffixes 0, that is, we consider all the power 
products in which the suffix 0 appears A 0 times, the suffix 1 appears 
A x times, and so on. If the sum of such a set of terms is denoted 

w = s/a .... un 

A 



Now let bj (i = 0, ...,k;j = 0, ...,») be (n.+ l)(&+l) indeter- 
minates, p 0 , ...,p n n + 1 additional indeterminates, and let 


?/„...<* = Pi 0 bl p ik b° it 
PiA, . p ik b k h 

“ Pi*Pii * * * Pik Pio ••• ik' 
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Then •••) and (•••, 7< 0 ...<*> •••) are coordinates of generic 

^-spaces. Hence 

0 = F(q) = S/a 0 ...a m (7) = Xpi- - Ptf x n (P)- 

A 

Since p 0 , . . . , p n are independent indeterminates over K(b*), it follows 

thftt /a 0 ...a„(p) = 0 

for each set A 0 , ..., A n . Hence 

Theorem II. Any form F(P) which satisfies the conditions of 

Theorem I is the sum of forms isobaric in each of the suffixes 0, ...,n 

which satisfy the conditions of Theorem I. 

In proving our first theorem we may therefore confine ourselves 

to forms F(P) which are isobaric in each of the suffixes. We now 

proceed by double induction. First, we assume that Jc is fixed, and 

let n vary. If n = k+ 1, an 8 k is a prime, given by a single linear 

equation , , _ 

O 0 *0+-+«fc+l»fc+l = °> 

and its coordinates are 


= (-1 )*-*«!>< = 

Since the a i can take arbitrary values independently, there is no 
non-zero form F(P) such that 

F(p) = 0 

for every S k in [&+ 1]. 

We may therefore assume, as hjrpothesis of induction, that 
Theorem I is true for the coordinates of ^-spaces in [n— 1]. Equi- 
valently, we may assume that the theorem is true for forms F(P) 
in which none of the Pi 0 .„i k actually present has n as a suffix. 

We now suppose that F{P), which is isobaric in each suffix, is 
isobaric of weight r in the suffix n, and is of degree t in P io ii . By 
the hypothesis of induction made above, Theorem I is true if r = 0. 
We therefore make the further hypothesis of induction that the 
theorem is true for forms F(P) which are isobaric of weight less 
than r in the suffix n. 

We write dF(p) 


to denote the result of replacing by the coordinates 

of an S k in 


dR 


<«...<* 
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Let bj (i = 0, j = 0, ...,n) be independent indeterminates, 
and let 

Pi 0 ...ik = 


b°i. 


b°- 

u ik 


b H hH 

U lo • U lk 

Then if O(P) is any form in P io ,.. iv (O(p) does not necessarily 
vanish), we write 

g(b) = O(p), 

and we find that 

Mb) ^ dO(p) 


where 


06 ) 


Hence 


Since 




(_l)fe+i 

6 ?. 



Mr 1 

• fifci 


K +1 



< 

* 


_ y „ 80(p) 

m - Ftp) = o, 


< = 0 


( 2 ) 


and the 6) are independent indeterminates, / (6) is identically zero, 
and therefore k 


2 bl 


= 0 . 


i =0 


By (2) we can write this in the form 

„ 3F(P) 

Hence the n forms 

(*-» »->) 

l o — lk-in 

satisfy the conditions of our theorem. Since each of them is isobaric 
in the suffix n of weight r— 1, we may write, by the hypothesis 
of induction, 

2 Pi 0 . ../*_! h ( 3 ) 

I ujr l 0 ...li-in 

where h = 0, 1. 
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We propose to deduce (1) from (3) by operating on each side of 
(3) with • g 

Pfl ~ » (^) 

m usr mQ ... mjc-\h 

and then summing with respect to h from h = 0 to h = n— 1. We 
first consider the effect of this operation on the left-hand side of (3). 


n— 1 

S 2 p m . 

h=0m Gjr m 


i P mp) \ 

li“- w aW 


_»-> af(.P) 

M. ‘-‘■“iU,. 


+ X 2 2 -^^...m*_in^Z 0 ...Z*_ 1 /i. 
A=0 m Z 


= utF(P) 


+ 2 2 2 ^n 0 ... ro*-i n ^Z 0 ... Z*_ r A 

fc = 0 m Z 


2 2 ^mo 

m l 

= [»r — t(t— 1)] P(P) 

"1* 2 2 
A=0 m Z 


homogeneous polynomials. 

Now let a 0 , ....a*, be a set of numbers chosen from (0, and 

consider the terms of the summation in (5) corresponding to 

m 0 , •••> Wt/c-l = «0> •"> 1> ®i+l> • • • > 

h = a f . 

Then the corresponding terms in the summation in (5) are 


■‘aJi. 



0 2 P(P) 

a^V.. 

f)P 

> wtjfe-i ft Z 0 ... /jfc-i n 


d 2 F(P) 


. m*_ x A „ij. in 


0 2 P(P) 

in 0P m „0P, ,p „ 

771q ... mjc—i 7i Iq ... ljc—\n 


0 2 P(P) 

'dP m 

m 0 .. 


and 

Euler’s theorem 


_0 2 P(P) 
r dP^dP^ 


P) r k 

L v P P 

> /a* Z 0 ...Zyfc_iOt x a 0 ...a{- 1 nai +1 ...ak 

l 0 ... lk-i n Lt=0 J 


mp) P 

- afV..«w 

a»W F , P , 


by §6, (2). 
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Hence, 

%' D yp WP) 


d*F(P) 


a l G - r a 0 ...ajb Gir lo"'lk-i n 

mg) P (P> 

^ ^ 7)P 7)P *lo — lk-i>na 0 ...aj e \ jr ) 

a l U£ a 0 ,,,ajc ujr l 0 ...lk-i n 

- Tl.-T+itWtSP^.js-? — (y.P^^ 1 ) 

l Gjr l 0 ...lk~in\ a vir a^,..ajc/ 


_VP ap ( p ) 


-ss. 


d 2 F(P) 


^ ^ ?)P 2P “ *o — lk-i*na 0 ... ajc 

a l G *a 0 ..,aje G *lo.--lk-i n 

= r(n — t + 1) F(P)+rtF(P) 

-tF(P) 

d 2 F(P) 


( p ) 


-ss 


o J ^Pa a ...ait^Plf'lk- i« 


Plo—lk-i,na 0 ...ak(P) 


= t(» + < — r) F(P) 


-S2 

a J 


3P„ 


a 0 . 


WF{P) 

..ak dPl„. 


F, 


.ik-in 


lo ••• w a<> ... ajc 


(n 


( 6 ) 


Since £>t, and we are assuming that r>0, the coefficient of F(P) 
in (6) is not zero. 

We now consider the effect of operating on the right-hand side 
of (3). We have 

DhL-Ai 1 ...ik,j 0 ...jk+i(P)Ph — ik.io — 1 k+i(P)] 

= Dh[ A i 1 ..AkJo..Jk+i(P)]Pii...ik.1o-.1k+i(P) 

+ A ii...ik,Jo-Jk+i(P) D h[P% x ... i k ,h - iJM 

From this equation, and equations (3) and (6), we derive (1) if we 
can finally prove that 

Dh[Pi 1 ...ik,j o...i*+i( P )] 
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is expressible as a linear combination of forms F ai%aicbi)bk+x (P) 
for all choices of i x , ... ,i k ,j 0 , ... ,j k+1 and h. 

(i) If h is different from i lt ...,i k and from j Q , ...,j k+v clearly 

(ii) If h = i v say, but h #= j 0 , ■••,jk+v if is easily seen that 

D h[ F i 1 ..A k .f 0 .ik+S P )] *«(*>• 

(iii) lfh= j 0 , say, but h * i lt . . . , i k , 

... { kt j 0 ...jk+S P )] = P ii...ik,nj l ...]k+S P )' 

(iv) Finally, if h = = j 0 , say, 

DfA-Fit ... f k ,j 0 ...j k+ 1 {P)] — F n la ik j o ^ i+I (-P) + F ix ik n) 1 ...ji + 1 (F). 
This completes the proof of Theorem I. 



CHAPTER VIII 


COLLINEATIONS 

In this chapter the ground field K is again assumed to be commuta- 
tive and without characteristic. It will not be necessary at first to 
assume that it is algebraically closed but, later, results will be 
obtained on the explicit assumption that K is algebraically closed. 


1. Projective transformations. Let 8 m , S n be two distinct 
projective spaces over the field K, of dimensions m, n respectively. 
Fix an allowable coordinate system x in 8 n and y in S m . Then 
consider the set of equations 

n 

Vi = (i = 0, a i} in K), 

i =o 

which we usually write in matrix form as 

y = Ax. ( 1 ) 

We may assume that the (m+l)x(n+l) matrix A is not the zero 
matrix. The equations (1) associate a point (y 0 , y m ) of S m with 
any point ( x 0 , . . . , x n ) of S n which does not satisfy the equations 


n 


2 aiM 
i “0 


= 0 (i = 


0 , 


m). 


These points, which we call the exceptional points of 8 n , fill a linear 
space of dimension n — r, where r is the rank of A. 

If r = n+ 1 there are no exceptional points, and (1) is said to 
define a non-singular projective transformation of S n on S m . In order 
that such a transformation should be possible, we must have 


If r<», the equations (1) are said to define a singular projective 
transformation of S n on S m . 

Now lot „ _ Px . (2) 


define an allowable transformation of coordinates in S n , and let 

y = Qy* (3) 

define an allowable transformation of coordinates in 8 m . In 
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the new coordinate systems the projective transformation (1) 

becomes „ , „ 

Qy* = APx*, 

that is y* = Q -1 APx*. 

By II, § 4, Th. II, we can find P and Q so that 

Q-'AP=(I r 0\ 


(t :)• 


where r is the rank of A. Calling r the rank of the transformation, 
we have proved 

Theorem I. By a suitable choice of coordinate systems in S n and 
in 8 m any projective transformation of rank r from S n to S m can be 
taken in the form 

Vi = (i = 0, ...,f — 1), Vi = 0 (i^r). 

We call this form of the equations (1) the canonical form, and in 
discussing the geometry of a projective transformation it is usually 
convenient to use the canonical form of the equations of the 
transformation. Before doing this, we consider the points of S m 
which lie in the prime m 

1 ,”^ = o. 

0 

In the projective transformation (1) these are just the points which 
arise from the points of S n lying in 

n 

J^UjXj = 0 , 


where 


u,= y-o, 

i **0 


We write these last equations in matrix form as 

u = A'v, (4) 

where A' is the transpose of the matrix A. These equations define 
a projective transformation of the primes of S m on the primes of S n . 
We call this transformation the dual of the projective transforma- 
tion (1). The transformations of coordinates (2) and (3) induce 
the dual transformations 


u* = P'u, v* = Q'v, 


21*2 
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respectively. Therefore the dual transformation (4) becomes, in 
the new coordinate systems, 

u* = P'u = P'A'v = P'A'(Q')- 1 v* 

= (Q-'APYv*. 

The relation between a projective transformation and its dual is 
therefore preserved under transformations of coordinates. This 
result is otherwise evident from the geometrical definition. 

If the point transformation is in canonical form, the dual trans- 
formation is in the canonical form 

u t = v t (i = 0, ...,r— 1), u t = 0 (i>r). 

Before discussing the geometry of projective transformations 
for arbitrary m, n, r, we consider the simple case in which 

m = n = r — 1. 

The canonical forms are then 

Vi = (i = 0, ...,»), = v { ( i = 0, . 

To each point of S n (prime of S m ) there corresponds a point of S m 
(prime of S n ), and conversely. The inverse correspondence is also 
a projective transformation of 8 m on S n . Linearly independent 
points or primes correspond to linearly independent points or 
primes, and, similarly, linearly dependent points or primes corre- 
spond to linearly dependent points or primes. Hence, any S k of 
S n corresponds to a unique fc-space S' k of S m , and any S' k of S m 
arises from a unique S k of S n . If S k is determined by the points 

of S n , and has Grassmann coordinates ...), S k is deter- 

mined by the points 

(&$,. ~X) (i = 0,...,k) 

in S m . Hence its Grassmann coordinates are (•••>?<„...<;., ...), where 
Vio...i k = b i • b l =Pi 0 ...if 

b l • b t 

It follows that the projective transformation carries the S k with 
coordinates •••), into the S' k of S m with coordinates 
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•••)> ^ or an y value of k. In particular, if k — m—1, 


the 8, 


n — 1 


(u 0 ,...,u n ) 


of S n is transformed into the 

(u 0 ,...,u n ) 

of S m , and this transformation of the primes of S n into the primes 
of 8 m is the inverse of the dual projective transformation of the 
primes defined above. 

We now return to the study of the projective transformation of 
rank r of S n on S m , where m, n are arbitrary, and r > 0. We choose 
coordinate systems in both spaces so that the projective trans- 
formation is given in canonical form by the equations 


(i = 0, ...,r— 1),1 
(i>r). } 

The dual transformation is then given by the equations 

(i = 0, 1), 

(i > r). 

The exceptional points of S n , that is, those which have no trans- 
forms, are the points of the S n _,. 

( i = 0 , 


Vi = 
Vi = o 


= Vi 
Ui = 0 


( 6 ) 


( 6 ) 


x. 


0 


We call this the singular space of S n . The locus of points in S m 
which are transforms of points of S n is given by the equations 

Vi = 0 (i>r), 

and is therefore a linear space of r — 1 dimensions, say. This we 
call the vertex of the transformation. 

Now consider the dual transformation (6). It is also of rank r, 
and the primes of S m which have no transforms are given by 

v { = 0 (i = 0,...,r— 1). 

These primes intersect in the vertex Z r _ v Similarly, the primes oi 
S n which are transforms of primes in S m are the primes which pass 
through the singular space 8 n _ r . 

The equations (5) define a non-singular projective transformation 
T* of the space S r _ x of 8 n , given by the equations 

Xi = 0 (i > r), 



326 


Yin. COLLINB ATIONS 


on 2*r- V and the equations (6), for i = 0, 1 , define the dual 

of T*. The properties of this non-singular transformation from 
£ r _i to Z-x follow at once from our previous discussion of the non- 
singular case of projective transformations. 

We now consider how the properties of the transformation (5) 
follow from a knowledge of the properties of T*. Two points 
(x 0 , . . . , x n ) and (x' 0 , ...,x' n ) of S n> which are not in the singular locus 
8 n _ r , are transformed by (5) into the same point of S m if and only if 

x t — Xx\ (i = 0, 1). 

Geometrically this condition indicates that the points have the 
same projection from the singular locus S n _ r on S r _ v this projection 
being (cc 0 , 0, Now, each point P of S n which is not 

in S n _ r is joined to S n _ r by a space 8 n _ r+1 . It follows that a point Q 
of S n which is not in has the same transform as P in S m if and 
only if Q lies in the S n _ r+1 defined by P. Thus, to any S n _ r+1 in S n 
wh^ch passes through the singular space S n _ r there corresponds a 
unique point in i7 r _ 1( and conversely. We may call this corre- 
spondence between the S n _ r+1 through S n _ r and the points of Z r _ x a 
projective correspondence. It is easily seen that in this correspond- 
ence the (» — r+ 1) -spaces of S n which pass through the singular 
locus 8 n _ r and lie in an (n-r + k- f 1) -space through the same S n _ r 
correspond to the points of a i-space in E r _ v In particular, primes 
of S n through the singular locus correspond to (r — 2)-spaces in E r _ v 

The properties of the dual transformation (6) from S m to S n are 
obtained in a similar manner. If 77 is any prime in S m which does 
not contain the vertex E r _ ly and if 77' is the corresponding prime 
in 8 n , which passes through the singular locus 8 n _ r , a necessary and 
sufficient condition that another prime 77 x of 8 m should have the 
same transform IT as 77, is that 77 and 77 x meet the vertex S r _ x in the 
same (r — 2) -space. To each S r _ 2 in E r _ x there corresponds a unique 
prime in 8 n through the singular space S n _ r . This correspondence 
between the (r — 2)-spaces in 27 r _ 1( and the primes of S n through the 
singular space iS n _, is precisely that obtained above by the direct 
transformation. 

Now let us consider what happens when S n and S m , instead of 
being distinct spaces, are the same space, which we denote by [»]. 
Then the transformation (1) takes a point P of [«] into another 
point P* of [»], and, more generally, an 8 k of [ri\ which does not meet 
the singular S n _ r is transformed into another S k of [»]. 
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It is important not to confuse this type of transformation in [n] 
with a transformation of coordinates. A transformation of co- 
ordinates is a relation between two sets of coordinates for the same 
point. The present correspondence is a relation between the sets 
of coordinates of two different points. 

A transformation of type y = Ax which transforms [to] into itself 
is called a collineation in [to]. Projective geometry is the study of 
the properties of configurations which are invariant under collinea- 
tions. In the study of collineations we can deduce, by what has been 
said above, properties of the transform of* any configuration, but 
we have a new problem to consider, that of the relation of a con- 
figuration to its transform. The remainder of this chapter is devoted 
to a study of collineations. 


2. Collineations. In the space [to] over K we choose an allow- 
able coordinate system and consider the collineation given by 
the equations n 

y i =Y i °'i i x i (i — 0, ...,to), 
j =o 

or, in matrix notation, 

V = Ax. (1) 

Let a transformation of coordinates in [to] be given by 

x — Px*. 


The coordinates of the points x, y are transformed into x*,y*, where 

x = Px*, y = Py*, 

and therefore in the new coordinate system the collineation has 
the equations _ p-iJ^Px*. 

The collineation dual to (1) is (cf. § 1) 

u = A'v, (2) 


u = (u 0 , ..., w n ) being the prime which contains every point x whose 
transform y by (1) lies in v = (v 0 , ...,v n ). If A is non-singular, (2) 


can be written as 


v = Au, 


(3) 


where A — (A') -1 is the matrix complementary to A. The collinea- 
tion (3) transforms a prime u into the prime v which contains the 
transforms y of all the points x of u, and no other points. 



328 


Vin. OOLLINE ATIONS 


The transformation of coordinates 

x = Px* 

ehanges (2) into u* = (P-'AP)' v*, 

and (3) into v* = (P _1 .4P)w*. 

Having seen the effect of a transformation of coordinates on 
equations (1), (2) and (3) we now define similar collineations of [»]. 
Two collineations 

y = Ax, and y — Bx 


are similar if there exists a non-singular matrix P with elements 


in K such that 


B = P-'AP. 


This relation is clearly reflexive and symmetric, and if 
C = Q~ l BQ ( Q non-singular), 
then C = (PQ)- 1 A(PQ), 

and hence the relation is also transitive. It is therefore an equi- 
valence relation. The relation between similar collineations is simply 
that there exists a non-singular projective transformation 

x — Px* 


of [w] into itself which transforms one collineation into the other. 

Our main problems here are 

(i) to determine necessary and sufficient conditions that two 
collineations be similar, and 

(ii) to find a set of canonical forms for collineations in [ri\, that 
is, a set of collineations with the properties: (a) no two collineations 
of the set are equivalent; ( b ) every collineation is equivalent to one 
member of the set. 

The canonical forms are chosen so that it is a simple matter to 
read off from them the geometrical properties of the collineations. 
We now prove 


Theorem I. A necessary and sufficient condition that the collinea- 
tions . D 

y = Ax, y = Bx 


should be similar is that the X-matrices A — XI n+1 and B—XI n+1 
should have the same invariant factors (or, in the case in which K is 
algebraically closed, the same elementary divisors ). 
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By II, § 9, Th. Ill, a necessary and sufficient condition that the 
A-matrices A — AJ n+1 and B — \I n+1 should be equivalent, in the 
sense of that section, is that they have the same invariant factors. 
When this is the case, 


B-XI n+1 = M(A-\I n+1 )N. 

But by II, § 9, Th. V, this equation implies the existence of non- 
singular matrices P, Q over K such that 


It follows that 

and 

Hence 

and therefore 


B — A/ n+1 — P{A — A I n+ i) Q» 
B = PAQ, 
l n +i — PQ • 

P = Q~\ 

B = Q-'AQ. 


Now let K be algebraically closed. We note first of all that 
det (. A - A7 n+1 ) and det {B - A I n+1 ) 
are each polynomials of degree n+ 1 containing the term 

(_ l)n+l a»i+i 

These polynomials are therefore not zero polynomials, and 

rank {A — AI n+1 ) = rank (B — XI n+1 ) = n+ 1. 

Hence the form of Theorem I which is applicable when K is alge- 
braically closed follows from II, § 9, Th. IV. 

Now let J E r 1 (A), ...,E n+1 ( A) be the invariant factors of A — M n+1 , 
and let E t ( A) be of degree r i . Since 


it follows that 


n+1 

(_l) n +i n E t (\) = det {A — AJ n+1 ), 

i — 1 

71+ 1 

2 r t = n + 1, 
i 


and since E t ( A) is a factor of E i+1 (A), 

ri<U+ i 

We suppose that 


7*2 ... — 7* g 0, 


and that 


r g +i > 0 . 
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Let 2?<(A) — A r < + a n A r » -1 + . . . + a iT{ (i > s), 

and consider the r t x r i matrix 



We prove that the invariant factors of B i — XI H are 

1* 1» •••> l>^i(A). 

For consider the determinants of the submatrices of t rows and 
columns extracted from B { — A I r{ . If 1 < t < we consider the 
particular submatrix whose elements lie in the 1st, 2nd, ..., £th 
rows of B { — A/ rj , and in the 2nd, 3rd, ..., (<+ l)th columns of this 
matrix. It is clear that the determinant of this particular txt 
submatrix is equal to unity. Hence the highest common factor of 
determinants oi txt submatrices extracted from B t — A I ri is of 
degree zero, if 1 < t < r t . But if t = r it 


det(B i -A/ r ,) = (-ir^ i (A). 


This proves the result. 

Now let B be the (n+ 1) x (n+ 1) matrix 

B = / B s+1 0 . 0 \ 

Jo B s+2 . 0 . (4) 

\ 0 0 . B n + v 

We prove that the invariant factors of B — A/ n+1 are 
E^X), ...,E S+1 (X) E n+ i(A). 

We can perform elementary transformations [II, §9] on the rows 
and columns of any submatrix B t — XI Tt (s+l^t^n+1) of B— XI n+1 
without altering the rows and columns of any other submatrix, 
since these submatrices have no rows or columns in common. Hence 
each submatrix can be reduced to its canonical form by elementary 
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transformations on B—AI n+1 . By another elementary trans- 
formation this matrix can be reduced to the matrix 

( I 8 0.0 

0 E s+1 ( A) . 0 

• • • • 

0 . . E n+1 (\) 

Since E { (A.) is a factor of E i+1 (X), the highest common factor pro- 
perty of invariant factors makes it evident that the invariant 
factors of this last matrix are 

1, 1, 1, E a+1 ( A), E n+1 (A), 

that is, E^ A), E s {\), E 8+1 (h), ..., E n+1 { A). 

Thus B — AI n+1 is equivalent to the matrix A — A I n+1 , since it has 
the same invariant factors [II, §9, Th. III]. 

We shall take the canonical form for collineations similar to (1) as 

V = Bx, 

where B is given by (4). Hence we have 

Theorem II. If the matrix A — A I n+1 has the invariant factors 

and if the matrix B is defined as in (4) above, the collineation 

y = Ax (1) 

is similar to the collineation y = Bx, and this latter form of the equa- 
tions may be taken as the canonical form for collineations similar to ( 1). 

When the ground field K is algebraically closed, and we use 
elementary divisors in place of invariant factors, it is usually more 
convenient to use another set of canonical forms for collineations. 
Let A — A J n+1 have the elementary divisors 

(A— a x )V..>(A — a*.) 4 *, 

where, for the moment, it is not necessary to distinguish the a t 
which are equal from those which are unequal. By the properties 
of elementary divisors obtained in II, § 9, 

e*>0 (i = 1, 

+ = n + 1 . 



and 
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Now let C e (a) be the e x e matrix 

a 1 0 . 0 

0 a 1 . 0 

• 4 • • 

0 . . a 1 

0 . . 0 a 

Then the matrix C e (a) — A I e has the one elementary divisor (A — a)*. 

For det (C e (a) - A/ e ) = (a - A) e , 

whilst the determinant of the (e — 1 ) x (e — 1 ) submatrix obtained by 
omitting the last row and the first column of the matrix under 
consideration is equal to unity. 

Now let C be the (n + 1) x (n + 1) matrix 

G= /C^K) o . o \ 

/ 0 C ei (oc 2 ) ° I . (6) 

\ 0 . C ek (a k )l 

By methods exactly similar to those used in the proof of Theorem II 
we can show that C — A/ n+1 has the elementary divisors 

(A ocj)*!, (A 

This gives us 

Theorem III. If A — A I n+1 has the elementary divisors 
(A-oq)*!, ..., (A — a fc ) e *, 
the collineation (1) is similar to the collineation 

y = Cx, 

where the matrix G is constructed as in (5) with the given elementary 
divisors. This collineation serves as a canonical form for collineations 
similar to (1). 

It should be noted, however, that geometrically the two collinea- 


tions 

II 

and 

py = Ax (p + 0), 
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if given in the same allowable coordinate system, perform the same 
transformation on the points, lines, ..., of [»]. From this it follows 
that two collineations 

y = Ax, and y = Bx 

are similar in a geometrical sense if the elementary divisors of 
A — A I n+1 and of B — A I n+1 are 

(A — a x (A — a*) e *, 

and (A-AK 1 . •••.(A-ytffeK* 

respectively, where, for a proper arrangement of these divisors, 

e i=U Pi = (*“1 &)> 

y being a non-zero element of K. 

In the remainder of this chapter we shall assume that the ground 
field K is algebraically closed. 


3. United points and primes of a non-singular collineation. 

If we are given a collineation in [n] over an algebraically closed field 
K we may, as in § 2, choose our allowable coordinate system so that 
the equations of the collineation are 


where 


y = Cx, 


C - /C e ,(«,) 0 

0 C ei (a 2 ) 


0 


0 



and 


<7 e (a) — /a 1 0 
I 0 a 1 

V 

\o 0 0 



The elementary divisors of the matrix G — A/ n+1 are 


( 1 ) 


(A— a])**, (A — a fc ) e *. 



334 


vm. COLLIN RATIONS 


It is usual to oall them the elementary divisors of the collineatioi 
(1). We note that if the original collineation is 

V = Ax, 

there exists a non-singular matrix P such that 

C = P-'AP. 

Hence C - A/ w+1 = P~\A - A/ w+1 ) P, 

and therefore 

det (C — A/ n+1 ) = det ( A - A/ n+1 ) 

= (-l) n+1 ri(A-a,)«i. 

1 

Hence, putting A = 0, 

det C = det A = af' ... aft. 

We shall assume in this section that the collineation is non-singular , 

that is, that , , . , _ 

* det .4 + 0. 

It follows that each latent root a i is different from zero. 

In the canonical form (1) the submatrices C^af) may be arranged 
in any order along the principal diagonal of C, but it will be con- 
venient to order them so that 

(i) those submatrices C^a^ which correspond to equal latent 
roots a { come together, and 

(ii) if a t — and e { < e } then G^aftj precedes C^(a^). 

We shall assume that the submatrices are arranged in this way 
in all future applications. 

A point (z 0 , . . ., z n ) of [ri\ is called a united point of the collineation 

(1) if it is transformed into itself by the collineation. The condition 

for this is ri 

Cz = pz, 

where p is a non-zero element of K. The equations 

(C-pI n+t )z = 0 

have a solution (z 0 , . . . , z n ) if and only if [II, § 8] 

det (C—pI n+1 ) = 0. 

Hence p must be a latent root of the collineation. Let us take p = a v 
and suppose that 

a t = a 2 = ... = a t , 

oc { =¥a, ( i>t ). 
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The equations to determine the united points arising from p = oq are 


ajz 0 = a 1 z 0 +z 1 , 

a i z i = <*1*1 +*2. . (2-1) 

a i z « x -i == a i z «!-i> . 


^l z «i ^a z ej "h z ej+l> 


a l z «i+«*-l — lX 2 z e 1 +e 1 -l> 



®l z n 'X'k Z n' 

From equations (2-1) we have, successively, 

z i = 0, z a = 0, ..., z £i _i = 0; 
from (2-2) we have, successively, since a x = a 2 , 

z e!+l = •••> z ei+e 2 -l = 

from (2- 1) we find that 

z e 1 +...+ej_ 1 +l = •••» 

if, finally, we consider the last equation and work backwards we 
find that _ n 

z ej+...+ej+ 1 'b ..., Z n — U. 

Hence a united point (z 0 ,...,z n ) arising from the latent root a x 
necessarily has all its coordinates zero except 

z 0> z ex> z «i+et> •••> z «j+...+«_ l * 

Conversely, we can verify at once that if (z 0 , ...,z n ) has all its co- 
ordinates zero except those named above, it satisfies the equations 
(2), and is therefore a united point. 
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If A 0 , ...,A n are the vertices of the simplex of reference of our 
coordinate system we deduce from these results that the united 
points corresponding to the latent root <x x of C form the linear 
spaoe of t — 1 dimensions spanned by the points A 0 ,A ei ,A ei+gj , ..., 
J ^e 1 +...+et-i- We call this space the fundamental space of the collinea- 
tion corresponding to the latent root a v 

The united points corresponding to any other latent root of C 
are found in a similar way. The following rule is immediately 
obtained: 

Theorem I. Let a i be any latent root of C, and let 

C'e q ( a q)> Q q+ f&q+l)} •••> ('e q n( a a+s) 

be the submatrices of C which correspond to latent roots equal to <x t . 
Then the united points corresponding to a t form the fundamental 
space spanned by the points 

A A A 

■^ei+.-.+eg-iy ■ a e 1 +...+e q > •••> -^ex+.-.+ej-n-x* 

and therefore lie in a space of dimension s. 

Since each fundamental space is determined by a set of vertices 
of the simplex of reference, and since also no two sets corresponding 
to distinct latent roots have any vertices in common we deduce 

Theorem II. The fundamental spaces defined by distinct latent 
roots have no points in common. Indeed, they are linearly independent ; 
that is, if their dimensions are m 1 ,...,m r their join is a space of 

r 

r — 1 + 2 dimensions. 

l 

Again, we see from Theorem I that the join of all the funda- 
mental spaces is the space spanned by the vertices 

AAA A 

■^-O* -"-ex* ■ a - ei+e% > •••> •••> -"ex+«»+...+«*_x> 

and is therefore a space of k — 1 dimensions. Therefore 

r 

r—\ + Yi m i~ h — \. 
i 

The join is the whole of [%] if and only if k = n+ 1, that is, if and 

— Cg . . . e jq 1 . 

This case, in which all the elementary divisors are linear, will 
sometimes be referred to as the non-special case; the special case 
being that in which at least one of the elementary divisors is of 
degree greater than one. 
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The collineation (1) transforms any &-space 8 k into another 
lb-space S' k , this space consisting of all points y which are transforms 
of the points x of S k . The reasoning is exactly that used in the case 
of a non-singular projective correspondence between two distinct 
w-spaces. In particular, when k = n— 1, if S k is 

u = {u 0> ...,u n ), 

and S' k is v = (v 0 , ...,v n ), 

the correspondence is given by the equations 

v = Cu. (3) 

The self-corresponding primes of this collineation are obtained by 
finding primes w = (u> 0 , ...,w n ) such that 

pw = Cw (p + 0). 


We are thus led to consider the elementary divisors of the matrix 

C- A/„„. 


Since 


0- fieM 


o 

0 


0 

0 


0 


and since 

6e(*) 




I 


a 


-l 


■a 


or 


-2 


or 


— a~ 


0 

0 

<x~ ] 



\( — l) e_1 <x~ e ( — l) e-2 a -e+1 
it is easily calculated that the elementary divisors of C — A/ n+1 are 
(A-ar 1 )*!, (A-afc 1 )**. 


However, it will simplify our notation if we observe that a necessary 
and sufficient condition that Cw = pw (for some p=f=0) is that 
C'w = aw (for some a 4= 0). Hence we need only consider the united 
primes of the dual transformation 

u — C'v (4] 


HP I 


22 
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to obtain the united primes of the collineation (3). The elementary 


divisors of G' — A/ n+1 are evidently the same as those of G ■ 

that is .. 

(A CL j )% (A CLfc) 


■A/. 


n+l» 


Hence the following equations determine the united prime 
(w 0 , w n ) corresponding to the latent root a x : 


ajw,, = 

a l w x = w (x + a 1 w l . 


(5-1) 


= w e t -i + a l w ei-l>) 


<x l w e 1 = <*2™e t > 

*i w e 1+ e t -i = ^ ei+ts -2 + a 2 w ( , i+es _ 1) _ 


(5-2) 


a l M, e l +...+ej_i — a l w e 1 +...+et- 1 > 
a l w ei+...+et-l — w ei+...+et-2^~ a -l w tii+...+et-i> ■ 


(5-t) 


a l w e!+...+e< a t+l' lv e 1 +...+et> 


Ct x W n = ^n-l + a fe W n- 
From (5-1) we find, successively, that 

«V2 = °» W e x -2 = .... w 0 = 0, 

and then, from (5*2), (5-t), we find that, since a x = ... 

w ei +e t -2 = 0, ...,w ei = 0, 

w e x +...+et-i = •••> w e x +...+et-i = 

and finally that 

w i = 0 (i*te x + ... + e t ). 

Hence the united prime necessarily contains the points 


= a,. 


AAA A 

^ 0 > • • • > ei~2> ^ e x > • • • * ^ei+eg-Z* •••> 

and, conversely, any prime which contains these points satisfies (4). 
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These points determine a linear space 8 n _ t , and the primes through 
it form a star with vertex S n _ t . This star of primes is the dual of the 
set of united points in the space S t _ lt and the properties of the star 
may be determined by duality from the properties of the set of 
united points corresponding to the latent root cl x . 

Corresponding to each distinct latent root of C' there is a star of 
united primes, and we see that there is a rule, similar to Theorem I, 
for determining the star of united primes corresponding to a given 
latent root. 

Theorem III. If a t is any latent root of C (or C') and 

C'e^( cx q)> Qj +1 ( a s+l)> •••» ^'eq+ t ( a q+s) 

are the submatrices of C which correspond to the latent root a t of C, 
then the united primes corresponding to a t - form the star whose vertex 
is the join of the points 

Aq, . . ., 2 > •••» 

A A A A 

■ cl e 1 +...+e q+ ,-i> • ••> • cl ej+...+ej +< -2’ • ia e 1 +...+e J+ *> 

We have agreed to place C^af) before C e .(a,j) in C if a* = ocj and 
e t < ey, suppose that 

^q = ••• = ^q+t = f > ^q+l+ 1> •'•>^q+S' > f* 

Then, by Theorems I and III we see that the space S s of united 
points corresponding to the latent root and the vertex S n _ 8 _ x of 
the star of united primes arising from the same latent root intersect 
in the space 8 defined by the points 

A A 

+...+e ?+ <> •••* - rl e 1 +...+e ?+ ,_ 1 * 

Similarly we observe that the space of united points arising from 
any latent root (a^ 4= a*) lies in the vertex of the star corresponding 
to the latent root a t . Hence we have 

Theorem IV. The fundamental space of united points arising 
from any given latent root a lies in the vertex of the fundamental star 
corresponding to any other latent root, and meets the vertex of the star 
corresponding to the same latent root a in a space of p — 1 dimensions, 
where p is the number of non-linear elementary divisors of the type 
(A — a) e (e> 1). 

As an immediate corollary, we deduce that if a collineation is non- 
special the vertex of the fundamental star corresponding to any given 
latent root is the join of the fundamental spaces corresponding to the 
other latent roots. 


22-a 
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Now let jS^_ x and S n _ ( denote the fundamental space and the 
vertex of the fundamental star, respectively, corresponding to the 
latent root a x of C, and suppose that 

1 = e x = ... = 6 S < &g+i ^ c s +2 ^ ^ (6) 

Let x be any point of [»], let y = Cx, and 

z = y-a 1 x = (C-aJ^x. (7) 

Evidently z is collinear with x and y, and from (7) it is the trans- 
form of x under the collineation with matrix C— a x 7 n+1 . Since 
det (C — a x / n+1 ) = 0, this collineation is a singular collineation. 
The point z is uniquely defined by (7) unless 

Cx = a x x, 

that is, unless x lies in S t _ v If x is not in S t _ x then we see from (7) 
that the coordinates of z satisfy the equations 

z 0 = Z 1 = ••• = z s- 1 = z s+e , +x - 1 = ••• = z 8+e, + i„.+et-l ~ 

That is, z lies in S n _ t . 

It is also easily seen that if x * is any point of the space S ( which 
joins x to S t _ v the transform of x* by (7) is also z, and the corre- 
spondence between the i-spaces through 5 < _ 1 and the points z of 
S n _ t is a projective one. We prove this result, which should be 
compared with a similar result in § 1, using the symbolic notation 
for points. If the points 

P 0 >Pv • • • » Pi-i 

form a basis for S ( _ v then 

C(P i ) = a 1 P i (i = 0,...,f-l). 

Let P be any point not in S t _ v Then any point x* in the S t joining 
P to S t _ x is given by 

x* = 2 A iPi+jiP, 

o 

for suitable A t -, y in K. Now 

Cx* = 2 XcW+pCiF) 

o 

- «iSA iPi+yCiP), 

o 


and therefore 


z = Cx* — ^^* = y[C(P) — a x P], 


( 8 ) 
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which is independent of the particular x* chosen in the 8 t . If P is 
chosen in an S' n _ t which does not meet 8 ( _ v then P determines and 
is uniquely determined by 8 ( . Since 8 ( _ x is the join of the points 
A 0 ,A V ...,A e _ v A g ,A 8+es¥1 , ■■■,A s+et+i+ _ +et i , we may take P to lie 
in the S' n _ t spanned by the remaining vertices of the simplex of 
reference. The equations (8) determine a unique point z in S n _f if 
P is taken anywhere in S' n _ t . Conversely, if we write down these 
equations, assuming that z is in S n _ t , we see that we can solve for 
the coordinates of P, and that the solution is unique if we assume 
that P lies in S' n _ t . Hence the correspondence between f-spaces 
through S t _ x and the points z of S n _ t may be described as a projective 
correspondence. 

Now let us consider the effect of the original collineation (1), 

y = Cx, on the S t determined by P and the space of united points 

S ( _ v Since t -\ 

y = Cx * = a 1 'Z,X i P i + pC{P), 
o 

the points of S t are transformed into the points of the S' t determined 
by the transform of P under (1) and S t ( _ v Since 

z = y — a x x* = n[C(P) — a x P] 

gives a point in S n _ t which is independent of the point x* chosen in 
S t , it follows that the correspondence between S t and S' ( is a per- 
speetivity, vertex z. 

Now let us assume that in (6) we have s < t, so that the collineation 
(1) has non-linear elementary divisors corresponding to the latent 
root a v Then 8 t _ x and 8 n _ t have a space of t — s—l dimensions, 
say, in common. If z is in and S t is the unique <-space 

through Sf_i which corresponds to z in the projective transformation 
described above, the transform of S t under (1) is in perspective 
with S ( , vertex z. Since this transform contains S t _ v and z lies in 
the transform must coincide with S t . Algebraically, if 

z= (z 0 , ...,z n ), 

each z £ except 

^8> 2 8+e, +1 > •••> z s+e J+ i +...+e»_j 

is zero, and if we solve (8) for the coordinates (x 0 , ...,x n ) of P, not 
restricting P this time, we find that 

P = (*®0> ®1> •"> *^8— 1> ® 8 > ^8> • • • » *®8+ej+i» 

z s+e 8 +i> • • • » •••> a: 8+e (+1 +...+e»_ 1 > z s+e s+1 +...+et- l > • • • » ®)> 
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where each x { can be chosen arbitrarily. These points P fill the space 
8 t which corresponds to z. This expression for the coordinates of P 
makes it evident that the collineation (1) transforms P into a point 
with coordinates of the same form. Hence, again, S t is transformed 
into itself by (1). 

If we anticipate Theorem VI, we can say that the collineation (1) 
induces a collineation in S t . This induced collineation is such that 
8 t contains an 8 t \_ x of united points corresponding to the latent root 
a v The number of elementary divisors (A — af) f i must therefore be t. 

t 

Since also S/» = i+1, there being no united points of (1) in 8 t 
i 

outside 8 t _ v it follows that the elementary divisors of the induced 
collineation consist of t— 1 elementary divisors (A — a x ), and one 
quadratic elementary divisor (A — a x ) a . 

We return now to the consideration of (7), x being any point of 
[»] which is not in a fundamental space. Its transform by (1), that 
is, the point y = Gx, is distinct from x. We consider the p points 

y-ctiX (i=l,...,p), 

where p is the number of distinct latent roots of C. These are the 
points in which the line xy meets the vertices of the fundamental 
stars corresponding, respectively, to the latent roots a v ...,a p . 
This range of points, together with the points x and y, is projectively 
related to any other range of p + 2 points constructed from the 
collineation (1) in the same way [VI, § 10]. We remark that if the 
line xy lies in a vertex of a fundamental star, the points x and y = Cx 
must lie in the vertex. Conversely, if x lies in a vertex of a funda- 
mental star, say in that corresponding to the latent root <x v so that 

Xq — = ••• = &g_i == %s+e t+1 -l = ••• — a 'g+eg+i+...+e<-l = 

then 

y 0~ Vi ~ ••• = Vs~ 1 = 2/g+e, +1 -l ==•••— y s +e s+l +.,,+et-l ~ 

so that y also lies in the vertex. The vertices, in fact, are all united 
spaces under the collineation (1). This is also evident from the fact 
that any prime through a vertex is a united prime, and the inter- 
section of any number of united primes is necessarily a united 
space. Returning to the ranges of points discussed above, we have 

Theorem V. If any point x of [n] is transformed by the collineation 
y—Cx into a point y 4= a;, the line xy meets the vertex of each of the 
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fundamental stars of the collineation. If x does not lie in a vertex, let 
A x , ...,A p be the unique points of intersection of xy with the vertices. 
Then if y* is the transform by y — Cx of any other point x* which 
does not lie in a vertex, and y* =1= x*, and ifx*y* meets the fundamental 
vertices in A*, ...,A*, respectively, the ranges x, y, A u ...,A p , and 
x*,y*,A%, ..., A*, are related. 

We conclude this section by noting two simple results which will 
frequently be used in the sequel. 

Theorem VI. If a space S k is transformed into itself by a collinea- 
tion, the points of S k may be regarded as transformed by a collineation 
in S k . 

The property of a space S k being transformed into itself under 

the collineation (1) is invariant under any allowable change of 

coordinates. For let one of the linear equations satisfied by S k be, 

in matrix notation, , . ... 

u x = 0. (9) 

Then since the point y = Cx continues to he in S k , 

u'Cx = 0. (10) 


The allowable transformation of coordinates x — Px* transforms 
(9) into 


v'x* — 0, 

where v = P'u, and transforms (10) into 

v'P-WPx* = 0, 


(9)' 

(10)' 


which is just the condition that the prime given by (9)' should pass 
not only through x*, but through 

y* = p-^CPx*. 

But this is the transform of (1) under the transformation of co- 
ordinates x — Px*. Our initial result is therefore proved. 

We may therefore choose the coordinate system so that the 

equations of S k are _ n _ n 

x k+1 — u, ..., x n — o. 


Then if the collineation in this coordinate system is 


n 

y i = I,a ij x j (i = 0 n), 

j -« 0 

we have the equations 

n 

'2,a ij x ] = 0 (i = k+ 1, ...,n) 
i ”0 
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for every point (x Q> ...,x k , 0, 0) of S k ; that is, for all choices of 
x 0 , x k . This implies that 

a {) = 0 (i > k,j < k). 

The transformation in S k is given by the equations 

k 

y i = 'Z,a ij x j (i = 0, ...,k), 

)- o 

which is a collineation in S k . 

The only united points possessed by this induced collineation 
are the united points of the given collineation which lie in S k . 

Theorem VII. In any collineation there is at least one united 
point and one united prime. 

We need only prove the result for a united point, the result for 
a united prime following at once by duality. Since any collineation 
has at least one latent root, and since to each latent root there 
corresponds at least one united point, the theorem is evident. 
There are oollineations with exactly one united point. As an 
example, consider the collineation 

V = C n+i(a)*- 

The elementary divisors are (A — a) n+1 , and the only united point is 

( 1 , 0 ,. .., 0 ). 


4. The united ^-spaces of a non-singular collineation. A col- 
lineation ' j ,-Ax (1) 


induces a transformation, given by VII, § 2, (4), 




a 


iolo 


a 


ik io 


a Hh 

a ikh 


I Pio — ii 


( 2 ) 


on the Grassmann coordinates of the ^-spaces of [ri\, and we could 
find the united ^-spaces of the collineation (1) by considering the 
^-spaces which satisfy the equations 


. 2 

U> 


®ioU 


a io1k 


Pio — ik ~ <r Pio-.ik > 


a ikh • a ikik 
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remembering that the coordinates p io .„{ k satisfy the quadratic 
^-relations [VII, § 6, (4)] 

k 

Pi„...ikPjt — ik = ^‘Pio — it-ihi»+i — ikPjo — i\+iiah+i — jk‘ 

A -0 


This method of procedure is not, however, very convenient. One 
difficulty is that a qualitative description of the united points of 
the collineation (1) in [n] is not, in itself, sufficient to allow us to 
determine qualitatively the united points of (2), regarded as a 


collineation in a space of 
n = 3, and 



— 1 dimensions. For example, if 


/a 

0 

0 

°\ 

1° 

p 

0 

°l 

\° 

0 

r 

°) 

\o 

0 

0 

8 


where a, 0, y, 8 are all distinct, we have a complete qualitative 
description of the united points of ( 1 ) in the statement that there 
are four united points, at the vertices of a simplex. But if k — 1, we 
obtain equations (2) in the form 

q = Bp, 

where B is the second compound A (2) of the matrix A, and may be 
written, [VII, § 2], 

B = Ia0 0 0 0 0 0 \ 

I 0 ay 0 0 0 0 \ 

10 0 0 0 0 I 

I o o o fly o or 

l 0 0 0 0 08 0 ] 

\ 0 0 0 0 0 ySI 

The collineation in [5] has six united points at the vertices of a 
simplex, if the elements which appear on the principal diagonal of 
B are all distinct. But if, say, aft = y8, the collineation has an 
infinite number of united points. These do not all satisfy the 
quadratic p -relations, and therefore do not all represent united 
lines in [3]. 

We therefore use a different method to find the united ^-spaces of 
a collineation. We begin with the facts [§ 3, Ths. VI, VII] that if an 
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8 k is transformed into itself a collineation is induced in S k , and this 
has at least one united point, which is a united point of the original 
collineation. Thus we need only consider those ^-spaces which meet 
a fundamental space of the given collineation. 

Let P be any point of the fundamental space corresponding to 
the latent root <x v We seek the ^-spaces through P which are united 
in the collineation. Our search will be greatly simplified by the 
following theorem: 

Theorem I. If P is any united point of the collineation (1 ) an 
allowable coordinate system can be found in which the collineation is 
given in canonical form by the equations 

y = Cx, (3) 

where 

C= A7 ei K) 0 0 

| 0 C^(a 2 ) . 0 

\ 0 0 . C ek (a k ) 

and P is a vertex of the simplex of reference. 

As in § 3 we assume that 

a x = a 2 = ... = a t =t=ai ( i>t ), 

and that e i < e f if i <j ^ t. By the results of § 2, an allowable system 
of coordinates can be found in which the collineation is given in 
the canonical form (3). The fundamental space of united points 
corresponding to the latent root ct x is defined by the vertices 

A-o> •••> ■d-e 1 +...+et-i> 

of the simplex of reference. Let P lie in this fundamental space, 
so that 

P = A 0 A q + A x A ei + . . . + A /_ 1 -4 ei +...+e«_ 1 * 

\ 

Let us suppose that . _ . . . . 

A i = 0 (i<m-l), 

but A m-1 4= 0. 

We find a transformation of coordinates which leaves (3) unchanged, 
and transforms P to the vertex A* 1+ of the new simplex of 
reference. 
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The equations of the collineation (3) have been written out in 
detail in § 3, (2). We are now concerned only with the particular set 


"If Gx+t.'+Cfn—x ^ , tn X ei+,,,+Cjtt~l ,+6^i—i+l7 

yei+...+em— 1 ^77i‘*'e 1 +...+«m— 1> 


(4*m) 


^6j+...+6m ^771+1 ^61+. .. +6//J ^ ^'Ci+,„+Cfli+l> 

yei +...+ Cjn+i — 1 ^771+1 x tti +.,. +6771+1“ 1* 


(4*m-|- 1) 


y « x +...+«_! — a t x e 1 +...+et-i+ X ei+...+et- 1 +l> 

ye t +...+et-l ~ a l x ei+...+et- 1 * 


(*<) 


We perform a transformation of coordinates which affects only the 
coordinates involved in the above equations. We write 

x *i+...+e-i-i+i 

— ^•m—l x e 1 +...+ei-i+j ^■t-l x ei+...+e m -i+i 0" == • * * > e m ^)> 

and x ei+. ..+a-i+j ~ ^■m—l X e 1 +...+ei- 1 +j (j = ®i tti» •••* ^)> 

for each value of i from i == m + 1 to i = t. We perform the same 
transformation on the y’s. 

The equations of the collineation in this new allowable coordinate 
system are obtained by multiplying the equations (4-i) by A m _ x and 
subtracting from each of the first e m of them A f-1 times the corre- 
sponding equation of the set (4 -m). The other equations of the 
collineation are unaltered. It follows at once that the new equations 
of the collineation are just the original equations with x*, y* 
written for x, y, that is, 

y* = Cx*. 

It is immediately verified that in the new coordinate system P is 
the vertex A* 1+ _ +eml of the new simplex of reference. Thus our 
theorem is proved. 

We now assume that the united point P of the collineation (3) is 
the vertex A Ci+ +emi of the simplex of reference. Consider the line 
joining P to any point x of [n]. The collineation (3) transforms this 
line into the line joining P to the transform y — Cx of x by (3), 
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since P is a united point. The two lines we are considering meet the 
face of the simplex of reference 

x e x +...+em-i = ® 

in the points (x 0 , ..., x ei+ ... +em _ 1 _ 1 , 0, x £i+ ...,x n ), and 

(y 0 , ■~,y ei +...+e m - 1 - 1. o, y ex+ ...+e m - 1+ i> •••> y n )- If ( x o> ■■; x n -\) & nd 

(y 0 , y n _x) denote the coordinates of these points in the coordinate 
system whose simplex of reference contains the vertices which lie 
in the face x ex +...+e m - x = 0, namely 

Aq, . .., A ei+ '' A ei+ " ,4.e m _ 1 +x» •••> A w , 

the relation between x and y is 

V = Cx, (6) 

where G is the nxn matrix obtained from C as follows: 

(i) if e m = 1 we obtain C from C by omitting the one row and 
column of G^aJ; 

(ii) if e m > 1 we obtain C from C by replacing C e JaJ by C' em _ 1 (a m ), 
that is by omitting the (e x + ... + e m-1 + l)th row and column of C. 

It is clear that a united line through P is the join of P to a united 
point of the collineation (5) in the prime 

X e x +.. .+«m-i = 

and conversely. More generally, any united fc-space through P is 
the join of P to a united (k— l)-space of the collineation (5) in the 
face of the simplex of reference opposite P, and conversely. We thus 
have a means of constructing a theory of the united ^-spaces of a 
non-singular collineation, by induction on k. 

A formal statement, in all generality, of the theory of united 
^-spaces is somewhat complicated, although it does not involve 
any serious difficulties, either of an algebraic or of a geometrical 
nature. It is usually simple enough to obtain the desired results in 
any particular case. We shall confine ourselves here to a description 
of the properties of the united lines of [w]. This involves a study of 
the united points of the collineation (5). 

We have seen that in the collineation (3) each submatrix G^a^ 
corresponds to a vertex A ei+ _ +ei i of the simplex of reference which 
is a united point, and the fundamental space of united points corre- 
sponding to a latent root <x i is the join of the vertices corresponding 
to submatrices for which — cc All these corresponding 
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vertices except P = -4 ei+ ... +em _ 1 lie in * ei+ - +ewl _ l = 0. It is at once 
seen that if A h is the vertex associated with the submatrix Gg.(ctf) 
of C, where j #= m, then A h is also the vertex associated with the 
submatrix C^.(oq) of C. When we consider the vertex associated with 
C em (a m ) (remembering that a x = ... = a m = ... = a, t ) there are two 
cases: 

(i) if e m = 1 the matrix C em (<x m ) is simply struck out of C to form 
G, and there is no associated vertex for the collineation (5). 

(ii) if e m > 1 we form C from C by replacing G em (a m ) by G tm- 

and the vertex associated with this latter matrix is the point 

A 

e 1 +...+e m _ 1 +l* 

In case (i) the united points of (5) are just the united points of (3) 
which lie in x ei+ .„ +em _ x = 0, and a point associated with any latent 
root a of C is also associated with the latent root a of G. In case (ii) 
the united points associated with the latent roots of G different 
from a x are the same as those associated with the corresponding 
latent roots of G. But the united points of (5) associated with a x 
form the ( t — l)-space defined by the vertices 

-^o >A Cl , -4 e ,+...+e m _ 2 > A ei+ ' +ejn i+1 , A ei+ ' +em , A e 1 +...+e,_ 1 * 

Hence, recalling our original reason for investigating the united 
points of the collineation (5), we have 

Theorem II. If the vertex -4 ei+ ...+« m _ 1 °f the simplex of reference 
is associated with the submatrix C em (a m ) of C, the united lines of (3) 
which pass through A ei+ ^ +em i either join the point to another united 
point of (3) or, in the case in which e m > 1, they lie in the t-space which 
joins the fundamental space of united points of (3), which passes 
through ^ ei+ ... +em _ l , to the vertex A ei+ _ +em _ i+v 

If we consider the implications of this theorem for the case 
e m > 1, we see that the i-space joining the fundamental $ ( _ x corre- 
sponding to the latent root a x to the vertex ^4 Cl+ _ +em _ i+1 must be 
united. This is immediately verifiable from the equations of the 
collineation. Since in connection with this new result the vertex 
A e 1 +...+« m _ 1 + x plays no special part, we evidently obtain a more 
general theorem thus: Let 

1 = e x = ... — e s < e s +i ^ e s+ 2 ^ ... 

Then the S ( which joins 5 < _ x to any point in the space defined by 
the vertices 

-‘ l ^e 1 +...+e 1 +l> J ^e l +...+e l+1 +l> •••> ^e 1 +...+e<_ 1 +l 
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is united. This result should be compared with that in § 3, 
p. 341, which gave the coordinates of a point P in a united S ( 
through S (-1 as 

P = {%o> ®1> • 1> •*'«> Z s > 0, . . 0, 3Jg+ e4+1 , ^s+e t +i> • • •» 

* c s+e J+1 +. ..+et-i> z s+e, +x +...+e<_ x > 0, •••> ®)* 

The two results are the same. 

An immediate deduction from the above is 

Theorem III. If a collineation is non-special the only united 
k-spaces are those determined by k + 1 united points. 

Finally, we remark that if a collineation has a non-linear elementary 
divisor, then there is a united line which contains only one united point. 
With the notation used above, if s < t the line 

e x +...+ej ■^■ei+...+e a +l 

is united, and contains the one united point A e . >+e< . This result 
will be found useful on later occasions. 

Our discussion of the geometry of the united points and spaces 
of a non-singular collineation has made use of the indices e ly ...,e k , 
and of the equalities between the latent roots; the actual values of 
the latent roots do not enter into our results. The geometrical 
properties of the collineation are therefore sufficiently indicated if 
we give the numbers e v ...,e k (whose sum is n+1) and indicate 
which of them refer to equal latent roots. We do this by writing the 
Segre symbol for the collineation. This consists of the e v ...,e k 
written inside square brackets in any order, subject only to the 
condition that indices which refer to equal latent roots are put 
together, and enclosed in round brackets. Thus in [3] there are 
the following types of non-singular collineation: 

[ 1 , 1 , 1 , 1 ] [( 1 , 1 ), 1 , 1 ] [( 1 , 1 ), ( 1 , 1 )] [( 1 , 1 , 1 ), 1 ] [( 1 , 1 , 1 , 1 )], 
[1,1,2] [(1,2), 1] [(1,1), 2] [(1,1,2)], 

[2,2], [(2,2)], 

[1,3] [(1,3)], 

[ 4 ]. 

Writing ( x , y, z, t) instead of (x 0 , x v x 2 , x 9 ), and denoting the vertices 
of the simplex of reference by X, Y, Z, T, the canonical forms of 
these collineations are, respectively, as follows: 
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(i) x' = ax, y' = fly, z' = yz, t' = St. 

The vertices X, Y, Z, T are the only united points, their joins the 
only united lines, and the faces of the simplex are the only united 
planes. 

(ii) x' = ax, y' = ay, z' = yz, t' = St. 

The united points are the points of X Y, and the points Z, T. A 
united line joins any two united points, and a united plane is a plane 
containing three independent united points. In particular, there is 
a pencil of united planes through ZT. 

(iii) x' = ax, y' = ay, z' = yz, t' = yt. 

The united points he on the lines XY and ZT, and the united lines 
and planes are obtained as in (ii). In particular, there are pencils 
of united planes through XY and ZT. 

(iv) x' = ax, y' = ay, z' = az, t' = St. 

The united points fill the plane XYZ, and the point T is also united. 
United lines and planes are found as in (ii). In particular, any plane 
through T is united. 

(v) x' = ax, y' = ay, z' = az, t' = at. 

This is the identity transformation. 

(vi) x' = ax, y' = fly, z' = yz + 1, t' = yt. 

The united points are X, Y, Z\ the united lines are Y Z, ZX, X Y and 
also ZT. The united planes are Y ZT ,XZT ,XY Z. 

(vii) x' = ax, y' = ay + z, z' = az, t ' = St. 

The fundamental spaces are XY and T. United lines either join 
two united points, or join a point of ZX to Y . The planes of the 
star with vertex F T are all united, and the plane XYZ is also united. 

(viii) x' = ax, y' = ay, z' = yz + t, t' — yt. 

The united points are Z and the points of X Y; united lines either 
join two united points, or coincide with ZT. The united planes are 
the planes through ZT, and the plane XYZ. 

(ix) x' = ax, y' — ay, z' = az + t, t' — at. 

United points fill the plane XYZ\ the united lines either he in this 
plane or pass through Z. Any plane through Z is united. 

(x) x' = ax+y, y' — ay, z' = flz + t, t' — flt. 

The united points are X, Z\ the united lines are XY, XZ, ZT, and 
the united planes are XZT, XYZ. 
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(xi) x' = ocx + y, y' = ay, z' = az+t, t' = at. 

The united points form the fundamental space XZ\ the united 
lines join the points (a, 0, c, 0) and (0, a, 0, c). The united planes all 
pass through XZ. 

(xii) x' — ocx, y' = /ft/ + z, z' = fiz + t , t' = fit. 

The united points are X, T; the united lines XY and YZ, and the 
united planes YZT and XYZ. 

(xiii) x' = ocx, y' = ocy + z, z' = az + 1, t' = at. 

The united points form the fundamental space X Y ; the united lines 
he in XYZ and pass through Y. The united planes form a star 
having YZ as vertex. 

(xiv) x' = a x + y, y' — ay + z, z' = az + t, t' — at. 

X is the only united point, X Y the only united line, and XYZ the 
only united plane. 

5. Cyclic collineations. As in V, § 2, Th. I, we see that the non- 
singular collineations in \n\ form a group. We now ask: what 
elements, if any, of this group are of finite order? That is, we seek 
to determine the non-singular collineations 

y = Ax (1) 

which are such that for some integral value of m, y = A m x is the 
identity transformation. This is equivalent to the condition 

A m = kl (&* 0). 

When this condition is satisfied, (1) is said to be cyclic, of order m. 
The successive transforms of any point P of [n] form a 

finite set containing at most m points, since P m = P. When m = 2 
the colhneation (1) is often called an involutory transformation of 
the points of [n], and in the case n = 1 the pairs of points P, P x are 
said to be in involution on the line. 

W.e show that if (1) is cyclic the elementary divisors of A must be 
linear. We saw at the end of § 4 that if A has a non-linear elementary 
divisor there exists a united line which contains only one united 
point. The collineation induced on a united line must also be cyclic, 
and its order must be a factor of m. If we choose the coordinate 
system so that the line is given by the equations 

x 2 = 0, ...,x n = 0, 

we can also assume that the induced collineation is in canonical 
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form. Sinoe there is only one united point on the line the canonical 
form must be 

y 0 = ax 0 + x v 
Hi = ocx lt 

corresponding to a non-linear elementary divisor. If we apply this 
collineation m times to (x 0 , xf) we obtain the collineation 

y 0 = a m x 0 + ma m - 1 x 1 , 
y x = a m x v 

This is the identical transformation if and only if 

ma* _1 = 0. 

Since the ground field K is without characteristic, and a + 0, this 
cannot happen. Therefore, if (1) is cyclic its elementary divisors are 
necessarily linear. 

We now write the collineation (1) in canonical form, which is 
y = /ctj 0 0 . 0 \ x, 

I 0 a 2 0 . 0 1 

\ 0 0 0 . ct n+ J 

since the elementary divisors are all linear. The condition that (1) 
be cyclic of order m is now easily seen to be 

af = k ( i = 1, ...,«,+ 1) 

for some non-zero k. This result may be stated as 

Theorem I. A necessary and sufficient condition that the collineation 

y = Ax 

be cyclic of order m is that the elementary divisors of A — A/ n+1 be linear, 
and the latent roots of A satisfy the equation 

x m = k, 

where k is a non-zero dement of the ground field. 

The condition can also be expressed in terms of invariant factors: 

Theorem II. A necessary and sufficient condition that the col- 
lineation A 

y =■= Ax 

be cyclic of order m is that the invariant factors of A —M n+1 have no 
repeated factors and that each invariant factor be a factor of the poly- 
nomial A m — k, where k is some non-zero element of the ground field. 


HP I 


33 
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This last theorem also holds when the ground field K is not 
algebraically closed. Indeed, let A be a non-singular matrix over 
any field K without characteristic, and let us suppose that the 
collineation (1) is cyclic of order m. Then 

A m = kl n+1 (k in K). 


Now let K' be an algebraic extension of K over which the character- 
istic polynomial of A is completely reducible. Each invariant factor 
of A — A/ n+1 is then completely reducible. The reasoning used in 
proving § 2, Th. Ill, shows that over K' the collineation can be 
reduced to the canonical form used in the case of algebraically 
closed fields. The reasoning given above then shows that the 
conditions given in Theorem II are necessary and sufficient for 
the equality 


A m = kl n 


+i> 


k being in K . 


6. Some particular collineations. In this section we consider 
some particular collineations which are of geometrical interest, 
confining ourselves to the case in which the ground field K is 
algebraically closed. Some of the results we shall obtain are par- 
ticular cases of theorems proved in § 3, in particular of Theorem V 
of that section. 

We first consider the non-special collineation with only two 
fundamental spaces S a and S b of united points. Since the collineation 
is non-special, we have [§ 3, Th. II] 

a + b+1 = n. 

The Segre symbol for the collineation is 

[( 1 , 1 ,..., 1 ),( 1 , 1 ,..., 1 )], 

with a + 1 units in one bracket, b + 1 in the other. The collineation 
can be written, in canonical form, as 


Ho = a*o> 


Vi = kjc v 


Va = <**«. 

Va + 1 - P x a+\> 
• • • 

Vn = fan* 
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where a =}=/?. The line joining any point x which is not in S a or 8 b 
to its transform y meets 8 a in the point z = y— fix, and S b in the 
point t = y— ax. If x* is any other point of [to] not in a united space, 
and y*, z*, t* are constructed as above, we see that the ranges 
(x, y, z, t) and (x*, y*, z*, t*) are related. 

We can construct this collineation geometrically, as follows. We 
assume that we are given the fundamental spaces 8 a and 8 b , that 
these do not meet, and that we are also given one pair P, Q of corre- 
sponding points which are not united points. Let PQ meet S a in 
R and 8 b in S. We find the point B corresponding to a given point A 
as follows. If A is in either united space, B = A. If this is not the 
case the join of A to S a is a space of a + 1 dimensions, B a+V say. 
Since a + 1 — n — Z a+ i meets S b in a point, at least. If ^a + 1 
met S b in a space 8 g (c > 0), then S c , lying in £ a +v would meet S a , 
which also lies in ^a+V Hence S a would meet S b , contrary to hypo- 
thesis. Hence ^•a+l and S b have only one point in common. Let this 
be D. Then AD, being a line in B a+1 , meets S a , in C, say. Hence A CD 
is a line through A which meets both S a and S b . It is the only line 
of this kind. For if AC'D' is another, C' in S a , and D' in S b , it con- 
tains two points A and C' of B a+1 , and therefore lies in Z a +v Hence 
D' is common to Z a+1 and S b . Therefore D' = D, and ACD = AC’D’. 

On ACD there is a unique point B such that 

(A,B, C,D) A (P, Q, R, S). 

This is the required point B. 

Since the collineation is non-special, the united ^-spaces of the 
correspondence are the spaces determined by k+1 united points. 
There are no others [§4, Th. III]. Hence they join an S a . in S a to 
an S b > in S b , where 

a' + b' + l = k (a' ^0,6' >0), 

or they lie in 8 a or in S b . In particular, any prime through S b (S a ) 
joins S b (S a ) to an S a _ 1 (»S 6 _ 1 ) in 8 a (S b ) and is therefore united. The 
spaces 8 b and 8 a are the vertices of the stars of united primes, as 
is evident from the dual transformation. 

A particular case is that in which a + /? = 0. The collineation is 
then cyclic, of order 2. If P is transformed into Q, Q is transformed 
into P, and, with the notation used above, (P, Q, R, S) is a harmonic 
range. This is the involutory case. 

Another special case of interest is that in which o = 0, b = n — 1. 
Our construction for the point corresponding to a given point 


«3-a 
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becomes very simple. The collineation in this case is called an 
homology, and the special case of homology, when ot+/3 = 0, is 
called involutory homology. 

We now turn to certain collineations in which there is only one 
fundamental space of united points, confining ourselves to those 
whose Segre symbol is [(1, 1, ..., 1, 2)]. We first consider the case in 
which n = 1. The equations are 

y 0 = ax 0 + x lt 
Vi = <xx v 

This is the same collineation, from the projective point of view, as 

y 0 = Xo + a- 1 *!, 

Vx = z x> 

and since the elementary divisor of this last collineation is (A — l) 2 
it follows that any collineation on a line with Segre symbol [2] can be 
written in the form 

Vo = x o + z i> 

Vx = z x> 

by a suitable choice of coordinate system. Hence these collineations 
are all protectively equivalent. 

Conversely, if we are given a collineation on a line with one united 
point, X, say, and a pair of corresponding points P, Q, a trans- 
formation of the above type is determined. We assume that X, P, Q 
are distinct. There exists a transformation of coordinates such that 
in the new coordinate system X = (1,0), P = (1,1) and Q = (2, 1). 
Let the collineation be given in this coordinate system by the 
equations y 0 = ccx 0 +px 1 , y 1 = yx 0 +8x v 

Then, since (1,0) is a united point, y = 0. Since also it is the only 
united point, there is only one elementary divisor, of degree two. 
Therefore 8 = a. Finally, since (1,1) is transformed into (2, 1), we 
have <x+fi — 28, and therefore 

a = ft = 8. 

The collineation is therefore projectively equivalent to 

y 0 = x i) + x v 
Vx = *i- 

Such a collineation is sometimes called a displacement on the line, 
its equation in non-homogeneous coordinates being 

y = x+l. 
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The collineation in [»] of type [(1, 1, ..., 1, 2)] has the equations 

Vo = ccx o, 

Vi = <xx v 

Vi i—2 = 2> 

Vn-1 = O^n-l + ^n. 
y* = aX n- 

The united points form the fundamental space 

x n = 0. 

If the equations of any united S k are 

n 

£ a n x j = 0 (i = 0 , ...,n — k—l), 

j =o 

the equations (1) show that 

n 

aEa«3 # +a in _iX n = 0 (i = 0, ...,n-k- 1) 

i =o 

for every point (« 0 , ...,x n ) of S k . If x n — 0 for every point of 8 k then 
S k lies in the fundamental space of (1). If x n is not zero for every 
point of S k we must have 

a in-i = ° (* = 0, 

if S k is united. Hence S k passes through the point 

^ = (0,0,. ..,0,1,0). 

Conversely, the equations show that if S k passes through this point 
it is united [cf. § 4], 

From the equations (1) it is evident that if P is transformed by 
the collineation into Q, 

Q = <xP+x n A, 

so that A, P, Q are collinear. If P is not a united point, so that 
A, P, Q are distinct, there is a collineation induced on the line. This 
has only one united point, the point A . Hence it is the displacement 
having A as united point which transforms P into Q. The displace- 
ment is thereby defined. 

Now consider any S n _ 2 which lies in the fundamental space, but 
does not pass through A . Any prime through S n _ 2 is transformed into 
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another prime through S n _ a . In particular, the join of P and S n _ t 
is transformed into the join of Q and S n _ a . Let any line through A 
meet the first prime in P' and the second in Q' . The collineation (1) 
transforms P' into the point on AP' which lies in the second prime, 
that is, into Q'. Thus, having chosen an S n _ 2 in the united space, and 
being given one pair of corresponding points P, Q, we can find a 
pair of corresponding points P', Q' on each line through A, and 
these determine the displacement on the line. The transformation ( 1 ) 
then transforms any point R into the point S into which R is 
transformed by the displacement on AR. 


7. Singular collineations. We conclude this chapter with a 
brief account of the singular collineations in [to]. The reasoning of 
§2 shows that, if the ground field K be algebraically closed, the 
coordinate system can be chosen so that the collineation may be 
written in the canonical form 




so that the submatrices corresponding to zero latent roots are 
together at the end. 

The discussion of united points given in § 3 is valid, subject only 
to the modification that the fundamental space S k _ h ^ l corre- 
sponding to the latent root zero is now the space of points which have 
no transform; but the geometrical relation of this to the other 
fundamental spaces is as before. 

Again, it is clear from (1) that the points of [to] which have a 
transform are all transformed into points in the space S n _ k+h 
whose equations are 

x e 1 +...+e/,+i-l = x ei+...+eh+t~l = •••> x n ~ ( 2 ) 

In the case in which 

6 h+l = e h+2 ~ ••• = e k = L 

the spaces $ fc _ A _ x and S n _ k+h have no points in common, and their 
join is [to]. The description of the collineation is particularly simple 
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in this case. Just as in § 3 we see that there is a projective corre- 
spondence between the (k — A)-spaces through <S' fc _ h _ 1 and the 
points of S n _ k+h . The collineation (1) may then be described as 
follows: 

A point x in <8 , fc _ h _ 1 has no transform. If x does not lie in S k _ h _ x 
it is projected from 8 k _ h _ 1 into the point 

^ = (*o> • • • > x n—k+fi> ...» 0) 

Of S n —k+h‘ ^n-k+h consider the non-singular collineation 

y*= /c ei (*i) o • o \ **, 

I 0 C et (cc 2 ) . 0 \ 

V 0 0 C eh {a h )/ 

where y* = (y 0 , ■■■,y n ~ k+h ), and z* = (x 0 , ...,x n _ k+h ). The point 

y = (y*,o,...,o) 

of 8 n _ k+k obtained from the transform of z* is evidently the trans- 
form of x under the collineation (1). 

If the indices e h+v ...,e k are not all equal to unity, the construc- 
tion of (1) by a series of non -singular collineations and projections 
is not quite so simple, because 8 k _ h _ x and 8 n _ k+h have points in 
common. We may, however, proceed thus: 

Let S n _ k+h be the space given by 

x e 1 +... +e/, ~ Q> x ei+...+eh+i = •••» ^ej+.-.+et-i = 

This has no points in common with the space S k _ h _ lt which is 
spanned by the vertices 

A A A 

+...+«*> •"«!+.. .+eA+x» • • • > A «!+...+«*_!• 

We project any point x not in 8 k _ k _ x from 8 k _ h _ x into a point x^ of 
Z n -k+h- Then 

_ _ - A 

/ ^ 

•^(l) = ( x 0> •"> ■*'«!+.. .+«*— 1> X e\+...+eh+l’ •••> •*'ex+.. .+«A+i—l> 

x e 1 +...+eh+i+l> '••)> 

the zeros corresponding to the first rows of the submatrices 

0),...,C ek (0) (3) 

in (1). Since we wish to consider a collineation in Z n -. k+k , we omit 
the zero coordinates in %>, and write 

*( 1 ) = ( x o> • • • > x ex+...+eh~l’ x e 1 +...+ef ) +l’ •••)• 
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We now perform the non-singular collineation in Z n _ k+h , 



where 4 , 
matrix, 


, I e . are each unity matrices, and D e is the (cyclic) exe 


D< = 


10 1 0 
0 0 1 


0 . 
1 0 


The effect of the matrices D e is to displace the first to the last 
coordinate in each batch of coordinates in xfc which corresponds to 
a submatrix (3), so that 

^(2) = (^O* • • • > x e 1 +...+eh-V x e 1 +...+eh+2i • x e l +...+eh+ 1 -l> x e 1 +...+eh+l> •••)• 

Now consider the ( k — h — l)-space whose equations are 

X 0 = x i= ... = x ei +...+eh-l = 
x ei+...+eh+l = ••• — 3 'e 1 +...+ey 1+1 — 2 = 0, 
a: «! x +...+eA +1 +l = ••• = x e l +...+eh+ v -2 = 0 , 


x ei+...+et-i+l = ••• = x n- 1 ~ 0 * 

x e t +...+e/, + x e 1 +...+e/ t+1 -l = °> 

x e 3 +...+e/ t +i x ei+...+eh+t~l 


x e l +...+e k - l + x n = 

This space evidently does not meet the S n _ k+h given by (2). If we 
project the point xfa in Z n _ k+h from 2T fc _ A _ 1 on to S n _ k+h (noting that 
Z n _ k+h does not meet Z k _ h _ 1 ) we obtain the point 

* s 

%) = • • • > x ei+...+eh~l> x ei+...+e k +V • • • > x ei+...+eh+i~V^> •••)> 

f If z is the point in which is collinear with sc*, in .£„_*+* and 

X( 8 ) in S n _ k+h , we see that z = x (3) -x%. 
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where in each batch of coordinates corresponding to a submatrix (3) 
the order is standard. Since' %) is in S n _ k+h , the coordinate corre- 
sponding to the last row of each submatrix is absent. Hence, if 
we omit the zero coordinates in X( 3) , and write 

~ (* 0 > • • ■ ’ •*'e 1 +...+e*+l> •••> a 'e l +...+eA + 1 -l> •••)» 

the non-singular collineation in S n _ k+h given by 



transforms x* 3 * into the transform of x by the collineation (1). 

It will be noticed that in each of the various projections and 
collineations we have used above, a united point of (1) remains a 
united point. For the coordinates of a united point are of the form 

(x 0 , X k , ...» %e 1 +...+e/ t —l> •••> 0 )> 

and are therefore unaffected. 

Finally, the Segre symbol can be extended to denote singular 
collineations. This is usually done by drawing a line above the 
indices e { in the symbol which correspond to a zero latent root. It 
should be noted that there is only one line, and this covers a single 
index, or a single set of indices in a round bracket. Thus the singular 
collineations in [3] are, apart from the improper one with zero 
matrix, y = 0, 

[1.1. 1.1] [(1,1), l,T] [(M),i,i] [(i,l)(M)] [(l,l,l),T] [(OT),1], 

[1.1.2] [1,1,2] [(M), 2] [(1,1), 2], 

[I, (1,2)] [ 1 ,( 172 )] [(fX2)], 

[2.2] [M], 

[3.1] [3,1] [(M) ], 

W- 
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CORRELATIONS 

In this chapter we assume once more that the ground field K is 
commutative and without characteristic. We shall indicate in the 
various sections whether K is also assumed to be algebraically closed. 

1. Correlations. Let S m , 8 n be two projective spaces of dimen- 
sions m and n respectively over K. In each of them choose an allow- 
able coordinate system. Then, if 

A = (a {J ) 

is an (m + 1) x (n+ I) matrix over K, the equations 

n 

v i ='Za ij x j (i = 0,...,m), 

j =o 

that is, in matrix notation, 

v — Ax, (1) 

determine a prime v, of equation 

v 0 y 0 +...+v m y m = 0, 

in S m corresponding to any point x of S n which does not satisfy the 
equations &x _ # 

If A is of rank r, the exceptional points in S n , that is, those with no 
transform under (1), form a space of n — r dimensions. The primes 
in 8 m corresponding to non-exceptional points of S n form a star of 
dimension r — 1, and hence all pass through a space of m — r dimen- 
sions in S m . We denote these spaces oi n — r dimensions in S n and 
m — r dimensions in S m by the symbols S n _ r , S m _ r respectively. 

If P and Q are any non-singular matrices, of orders 1, n+l and 
m + 1 respectively, over K, the equations 

x - Px*, y = Qy* 

determine allowable transformations of coordinates in 8 n , S m . The 
f An (»+ 1) x (»+ 1) matrix is sometimes said to be of order n+l. 
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corresponding transformations of the prime coordinates in these 
spaces are given by the equations 

u* = P'u, and v* = Q'v, 

and (1) becomes ^ = q APx * 

in the new coordinate systems. 

Since A is assumed to be of rank r, we can determine P, Q so that 


Q'AP = 



The relation (1) now becomes 


„* = £* (i = 0,...,r-l), 
vf = 0 (i r) 

in the new coordinate systems. The space S n _,. is given by 

a;* — ••• = %?-i = 0, 


and S m ^. by 


y* = ••• = yf-i = o. 


By an argument similar to that used in VIII, § 1, we can show that 
the points of any (n — r + 1 )-space through S n _ r all correspond to 
the same prime through S m _ r , and there is a one-to-one corre- 
spondence between these (n — r+ l)-spaces through 8 n _ r and the 
primes through S m _ r . 

Now choose two (r— l)-spaces, <S r-1 in S n and in S m , which 
have no intersection with S n _ r and S m _ r respectively. Then there is 
clearly a correspondence between the points of (S r _ x and the primes 
((r — 2)-spaces) of S ' r _ x , in which a point P of 8 r _ x corresponds to the 
intersection of 8 with that prime of S m through 8 m _ r which 
corresponds to P and to the join of P to S n _ r . 

The properties of the correspondence (1) can evidently be 
obtained at once from the properties of the point-prime corre- 
spondence between /S r _! and S' r _ v Hence we need only consider 
that case of (1) in which r = ra+ 1 = n+ 1. 

We then have a non-singular correlative correspondence between 
two distinct spaces of equal dimension, 8 n and S m . If the allowable 
coordinate systems in the two spaces are suitably chosen, the 
correspondence is given by the equations 

v i = x i (i = 0 , ..., to = n). 
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Linearly independent (dependent) points of S n give rise to linearly 
independent (dependent) primes of S m . Hence the points of S n 
which lie in a linear space S k are transformed into the primes of S m 
which pass through an S m _ k _ v If (.. M p lo ti , ...) are the Grassmann 
coordinates of S k it is evident that the coordinates of are 

...), where 

qio-h = p Uii . 

Indeed we can pass from S n to S m by the projective transformation 

Vi = x l (i = 0,...,m), 

followed by the dual transformation in S m itself. The enumeration 
of the properties of this correspondence is a simple matter, and 
need not detain us here. 

In this chapter we are more concerned with the case in which the 
two spaces S m and S n coincide in a single space, which we denote 
by [n]. We choose an allowable coordinate system in [»], and 
consider the equations 

n 

n i = ^a ij x j (i = 0,...,»), (2) 

i - o 

that is, u = Ax, 

in matrix notation. These equations relate a point x of [n], which 
does not lie in the space whose equations are 

Ax = 0, 

to a prime u whose equation is 

u 0 x 0 + ... +u n x n = 0. 

Such a relation is called a correlation in [n]. If 

x= Py 

is an allowable transformation in [»], the induced transformation 
of the prime coordinates is 

u = Pv, 

where P is the complement of P. Equations (2) then become 

v = P’u 
= P'Ax 
= P'APy. 
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One of the main objects of this chapter is to obtain canonical 
forms for sets of equivalent matrices A, B , ..., equivalent under 
the relationship ^ _ p>^p 


where P is non-singular. This will enable us to write the equations 
of any given correlation in canonical form, so that we can examine 
its properties. But before doing this we examine certain preliminary 
notions concerning correlations which, besides being of interest in 
themselves, will be useful in our discussion of equivalent correla- 
tions. 

The correlation (2) sets up a correspondence between points x 
and y of [»] in which there corresponds to any point x all the points 
y of the prime u. The point y corresponds to a; if and only if y'u = 0, 
that is, if and only if y ’ Ax = a . (3 ) 

The equation (3) is said to determine a bilinear correspondence 
between x and y, and the form y' Ax, which is linear in the inde- 
terminates x and y, is called a bilinear form. From (3) we can 
always obtain (2). Since we can also write (3) as 

x'A'y = 0, 

we see that if y corresponds to a: in the bilinear correspondence 
arising from (2), then x corresponds to y in the dual correlation 


v = A'y. 


(4) 


Hence the points x whose associated primes under (2) contain a 
given point y lie in the prime v given by (4). 

The correlation (2) and the dual correlation (4) suggest another 
correspondence, between points which give the same prime in 

(2) and (4), Ay _ ^ (6) 

This correspondence can be interpreted geometrically as follows: 
if z is any point corresponding to x in the bilinear correspondence 

(3) , then z' Ax — 0. Using (5), this becomes 

z' A'y — 0, or y'Az = 0, 

which shows that y corresponds to z in (3). In the case of a non- 
singular correlation, that is, when A is non-singular, (5) is the 
collineation y _ Aa% . (6) 

we call this the associated collineation. 
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If we transform the coordinates by the equations 

x — Px*, 

writing B - P'AP, 

(2), (3), (4) and (5) become, respectively, 

u* = Bx*, 
y*’Bx* = 0, 
v* = B’y*, 

'and B'y* = Bx*. 

Hence the bilinear correspondence, dual correlation and, when defined, 
the associated collineation are invariantly related to the given correlation. 

The study of the correlation (2) will be greatly simplified if, at 
the same time, we consider the associated correspondences given 
by (3), (4) and (5). We begin by considering the special case in which 
the correlation (2) is identical with the dual correlation (4); that is 

A'x = pAx 

for all points x in [n], p being some non-zero element of K. Then 

A' = pA. 

Forming the transpose of these equations, we obtain 

A = pA'. 

Hence A — p 2 A. 

Therefore p = ± 1. 

When p = 1, so that A' = A, the matrix A is a symmetric matrix. 
The correlation (2) is then called a polarity. When p = — 1 , so that 
A' — — A, and A is skew-symmetric, (2) is called a null-polarity. 
We shall find it convenient to consider the properties of polarities 
and null-polarities in some detail before considering the properties 
of correlations in general. 

We conclude this introductory section by remarking that we 
may consider, if we wish, a correspondence between two (possibly 
identical) spaces S n and S m in which primes in S n determine points 
in S m by means of the equations 

n 

yi = 1,b ij u j (i — 0, ...,m), 

i-o 

y = Bu. 


or 


( 7 ) 
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It is clear that the properties of such a prime-point transformation 
can be obtained from the properties of the point-prime trans- 
formation we are discussing, and it is not necessary to discuss 
prime-point transformations separately. We note, however, that 
when m = n and both A and B are non-singular, the point-prime 
transformation (1) determines the prime-point transformation 

x = A~ x v 


from 8 m to 8 n , and the prime-point transformation (7) determines 
a point-prime transformation 


from S m to S n . 


u = B~hf 


2. Polarities. Let u — Ax (1) 

be the equations of a polarity in an allowable coordinate system 
in \n\. The bilinear correspondence which arises from (1) is 

y' Ax = x' Ay — 0. 

Any two points £, y which are specialisations of x, y such that 

y'Ai = Z'Ar, = 0 

are said to be conjugate points in the polarity (1); if a point is 
conjugate to itself, it is self-conjugate in the polarity. If £, y are 
conjugate points, the prime A£ corresponding to £ contains y, and 
the prime Ay corresponding to y contains £. 

We begin our study of the polarity (1) by obtaining an allowable 
coordinate system in [ri\ in which the equations of the polarity can 
be written in a standard form. There are several methods by which 
we may proceed. The one we shall follow is one which will prove 
useful later in this chapter. Our method involves the bilinear form 

n n 

y'Ax^^ T,a ij y i x j (2) 

in the two sets of indeterminates ( x 0 , ...,x n ) and (y 0 , ...,y H ), and only 
uses transformations of coordinates of a very restricted type. We 
shall, for the purpose of the present reduction, say that an allowable 
transformation of coordinates in [w] given by 

x = Px* 

is of restricted type if the elements of the matrix P above the prin- 
cipal diagonal are all zero. Using the rules given in Chapter II for 
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constructing products and inverses of matrices, we can prove at 
once that the product of two transformations of restricted type is 
also a transformation of restricted type, and the inverse of a (non- 
singular) transformation of restricted type is also a transformation 
of restricted type. We now prove 

Theorem I. By a suitable transformation 
x =» Px*, y = Py * 

of restricted type, the bilinear form y' Ax can be reduced to the form 


(y% x Z + 2/<lO + (« + yto + ... + + yt-i x t-t) 

+ C irVtr X iir yt x l> 

where (i 0 , ..., i n ) is a derangement of (0, ...,n), r is a suitable integer 
such that 0 < r < \{n + 1 ), and c 2r , . . . , c n are in K. 

There is nothing to prove if n = 0, and we therefore proceed by 
induction on n, assuming that the theorem is true for bilinear forms 
in the indeterminates ( x 0 , ...,x m ) and (y 0 , ...,y m ), where m<n. 

Case (i). If a in = a ni = 0 for all values of i, 

n— 1 n— 1 

y'Ax = S 2 auVi^. 
i-oi-o 

By hypothesis there exists a transformation 

n — 1 n — 1 

x i = 2 quip y t = 2 q^Vj (* = o, ..., n- 1 ) 

j « 0 j =0 

of restricted type which reduces the bilinear form 

71—1 71—1 

2 2 dijViXj 

i=o ; -o 

to the form required by the theorem. Then the transformation 

x = Px*, y = Py*, 

where 
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is of restricted type, and reduces y' Ax to the required form, with 
i n = n, and c n = 0. 

Case ( ii). a nn =f= 0. We may write 

n n 

y' Ax = S 2 

1=0 j*0 

( n \ / n \ n-1 n-1 

2 ^ inVil I 2 2 2 ^nn 

i= 0 / \i = 0 / i-0 j =0 

Since a in = a ni , the transformations 

* = Q^C. y = Q~ x y, 

where 



are of restricted type, and reduce y' Ax to 

n— 1 n— 1 
i=*0 j=0 

where c n = a~*, and b it = a;iKn a ir“inS’)- By the hypothesis 
of induction we can perform a restricted transformation on 
(Co. •••» Cn— i) and 07o. •••.i/n-i) so ^at 

71—1 71 — 1 

1=0 j =0 

is a form of the kind required by the theorem and then, just as in 
Case (i), we deduce a transformation of restricted type which reduces 

71—1 71—1 

2 2 bijViisj "t" ^nVn^n 

i= 0 0 

to the required form. Since the product of two transformations 
of restricted type is also of restricted type, the form y' Ax is 
reduced to the required form by a transformation of restricted 
type. 
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Case (in). a nn = 0, a in = a ni + 0 for some values of i. Let k be 
the largest value of i for which a in 4= 0. Then 

( k \ / k \ n— 1 n— 1 

S a inVi ) x n + 2 2 

i =0 / \i — 0 / i-0*-0 

71—1 71— 1 

= 2/n£fc + *n1?fc+ 2 2 a iiVi x ii 

i-0 j =0 


where 


£* = 




2 

i-0 


From this last equation, we have 

fc-i 

= - 2 anfc«„i^ + «n^fc- 
i=0 

Replacing by this expression in the indeterminates x 0 , . . . , 
and making a similar substitution for y k , we find that 

Ti—l n— 1 

i -0 y-o 

71—1 71—1 

= y n £k + X nVk+ 2 ' a kjVkXj+ 2 ' 

?=0 i-0 

71 — 1 71—1 

+«fcfc2/fc«fc+ 2' 2 

i-o y-o 

= ( 2'&<</i + */n) Cfe + 5/fc ( 2'Mi + *n) +«fcfc«n!4^fc 

+ 2' 2' bijViXj (b i} = bji), 

i — 0 ji — 0 

where the 6 i; b i} are elements of K easily determined by direct 
substitution. Now, the transformation 

k 

ik = 2 a ni x i> 

i-0 
/ yh^ m 1 

£n = 2' b i x i + x n + la kk a~l £ ®n<*i> 

i-0 i-0 

^ = ^ (t = 0, • , , y Jc l,A/“f'l,...,?2' 1), 


71—1 

+ In the summations S' the value i = k is omitted. 
i-0 
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is of restricted type, and therefore its inverse is of restricted type. 
The form above becomes 

Vn£k + Vk£»+ S' S' biiViij 

1 = 0 1 = 0 

under the inverse transformation. We now apply our hypothesis 

n— 1 n— 1 

of induction to S' S' This is reduced to the form required 

1=0 ;= 0 

in the theorem by a transformation of restricted type on 

(£o> • • • > £k—l> £k+ 1> • • • > £n— l)* 

If the new indeterminates are 

(?o> •••» £k— 1> ?fe+i> •••»?»»— 1)» 

and we add to the equations of transformation the two equations 

St = St. 

s„ = s, 

in their correct position, the new matrix is still of restricted type, and 
we have reduced y' Ax by a sequence of restricted transformations 
to the required form. Thus our theorem is proved. 

The use of transformations of restricted type is of purely auxiliary 
importance, and when we are concerned with only one symmetric 
bilinear form y' Ax we can obtain a simpler standard form by using 
allowable transformations which are not of the restricted type. 
We first carry out the transformation 

_ p* . r* 

~ 5; 0 ' fetv 

_ p* _ p* 

• c l l — b i 0 bii> 

• • • 

4- r* 

t 2 r -2 r-a ‘ 
a;* = £* —A* 

= (j = ^2r> • • • > ^n) 

on the standard form obtained in Theorem I. It becomes 

Z’b’ttd 

j-0 

= 2 (i = 0, ...,r— 1), 

<> 21+1 = - 2 (* = 0, ...,r— 1). 


where 

and 


a*-a 
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Remembering that (i 0 ,...,i n ) is a permutation of (0, ...,n), we 
finally use a transformation which merely amounts to a permutation 
of the indeterminates and we have 

y' Ax = 'Zd t Y i X u 

i-0 

where d 0 , ...,d s are not zero. If the resultant of all the transforma- 
tions used is the transformation 

x= RX, y = RY, 

then 



and therefore the rank of A is s + 1 . Hence we have 

Theorem II. (a) By a suitable choice of the allowable coordinate 
system any symmetric bilinear form y' Ax can be reduced to 

i= o 

where s + 1 is the rank of A . 

(b) By a suitable choice of an allowable coordinate system the 
polarity u = Ax can be reduced to the equations 

U { = diX { (i = 0,...,n), 

where d 0 ,...,d s are not zero, but d s+1 = ... = d n = 0, the rank of A 
being s+ 1. 

When the ground field K is algebraically closed we may use 
a further transformation 

X'f = d\Xi (i — 0, ...,s), X * = X{ {i = s+ 1, ...,n). 

The effect of this is to replace each d { (i < s) in the above Theorem 
by 1. Hence we have 

Theorem III. If the ground field K is algebraically closed, the 
only projective invariant of a polarity is its rank. By a suitable choice 
of the allowable coordinate system the equations of a polarity of rank 
s + 1 may be taken as 

u i = x i (i = 0,...,s), U{ = 0 (» = «+!,...,»). 
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The system of allowable coordinates which reduces a polarity to 
the form given in Theorem III is not uniquely determined, as the 
following discussion will show. We assume that the matrix A in (1) 
is of rank s + 1, and that O 0. Let 




be any point which is not 
(1). Then 

The equation 


self-conjugate with respect to the polarity 

(3) 

*«»' Ax = 0 (4) 


determines a prime. This prime contains the S n _ given by 

Ax = 0, 


coinciding with it if a = 0. By virtue of (3) the point F* 0) does not 
lie in the prime (4). We may therefore choose n independent points 
in this prime, say, 

P® = (4,..., a;*) = 


which, together with P< 0) , form a simplex. We take this simplex as 
a simplex of reference for a new coordinate system, noting, first of 

all, that sWAaP = 0 (*=!,...,»). 


Let the equation of the bilinear correspondence determined by (1) 
in the new coordinate system be 


y*'Bx* = 0 (B = (b {j )). 


Then if x* (i) is the transform of xf®, 


x * WBx *(i) = o (i = l,..., n). 
But = (^o. • • ■ • , #in) (* = 0, . . . , n). 


Hence b oi = 0 (i = 1, 

Since x* (0), Bx* (0) 4= 0, 


it follows that 


^oo 4= 0. 


Hence 


y*'Bx* = b oo!/o z 0 +S S b {j yfxf, 
i= u-i 



and the matrix 
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must be of rank 8. Repeating the process just described for the 
bilinear form n n 

2 Xb ijy *xf, 

we arrive at the following result: 

Theorem IV. If a polarity is of rank s + 1 we can construct a 
simplex of reference, referred to which the polarity has its equation in 
canonical form, as follows: 

(i) there exists a point P< 0) which is not self -conjugate with regard 
to the polarity; 

(ii) there exists a point P® which is not self-conjugate with regard 
to the polarity and is conjugate to P< 0) ; 

• *•••••• 

(i + 1) there exists a point P® which is not self-conjugate with regard 
to the polarity, and is conjugate to P< 0) , ..., P <-1) ; 

••••••• 

(s + 1) there exists a point P® which is not self -conjugate with regard 
to the polarity, and is conjugate to P (0) , ..., P (3 ^ 1) ; 

(s + 2) now let P s+1) , . . . , P re) ben — s independent points of the space 
Sn-8-i whose points have no corresponding prime in the polarity. 
Then we can take P (0) , ..., P (n) as simplex of reference, and, choosing 
the multipliers suitably, the equations of the polarity become 

Ui = Xi (i = 0, «i = 0 (i = « + l, . 

The properties of a polarity are easily read off from the canonical 
form found above. In particular we observe that the properties of 
the polarity we have been considering, 

«< = x i (*«0, 1 

Ui = 0 (i = s+l,...,n),j K ’ 

can be deduced immediately from those of the non-singular polarity 

u { = Xi (i = 0,...,s) (6) 

in the space S s whose equations are 

x 8+1 = ... = x n = 0. 

The space S n _ s _ v whose equations are now 

x 0 = ... = x s = 0, 

is called the vertex of the polarity. No point in this space has a 
corresponding prime in the polarity (5). Since the prime corre- 
sponding to any point not in S n _ g _ 1 passes through S n _ t _ t , it follows 
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that a point of (S n _ 4 _ 1 is conjugate to any point of [»] not in S n _ s _ v 
It also follows immediately, from the equations, that two points P 
and Q of [»] which are not in S n _ g _ 1 are conjugate with regard to (5) 
if and only if their projections from 5 n _ g _ 1 on to 8 a are conjugate 
with regard to (6). 

The geometrical meaning of polarities will become clearer when 

we discuss conjugate points in the chapter on Quadrics in vol. n. 

We need only remark here that points which are conjugate with 

regard to the polarity _ . 

U — AX 

are also conjugate with regard to the quadric 

x'Ax = 0, 

and conversely, the relation between the points being the same in 
each case. 

The non-singular polarity (6) transforms the points of an S k 
whose Grassmann coordinates are (■••,Pi 0 ...{ k , •••) into the primes 
through an »S g _ fc _ 1 whose dual coordinates are where 

This is evident from the equations (6). Conversely, the transform 
of the points of S s _ fc _ x is the set of primes through S k . The spaces 
S k and are said to be conjugate in the polarity. Every point 

in the one space is conjugate to every point in the other space. We 
now prove 

Theorem V. If S k and *S 8 _ fe _ 1 are conjugate in a non-singular 
polarity, a necessary and sufficient condition that they should meet is 
that S k should contain a point P which is conjugate with regard to 
every point of S k . 

Let y®= (yi,...>yi) (i = 0,...,k) 

be Jc + 1 independent points of S k . The equations of <S 4 _ ft _ 1 are given 
by the equations 8 

'£y t jZj = 0 (i = 0,...,k). 

i-o 

If S 8 _ k _i contains a point of S k> say the point 

a —0 

the equations in A 0 , ..., A k , 
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must have a solution. But the condition that there exists a proper 
set A 0 , ...,A fc such that the equations above are satisfied is 
precisely the condition that the point 

o 

is conjugate to each of the points y^°\ ...,y (k) . The point x is then 
conjugate to every point of S k . This proves the theorem, which is 
evident geometrically. 

If we follow the point-prime polarity 

u = Ax 

by the prime-point polarity 

y = Bu, 

the result is the collineation 

y = BAx. 

We conclude our discussion of polarities by proving, in the case 
when the ground field K is algebraically closed, 

Theorem VI. Any collineation in [ri\ can be expressed as the result 
of a point-prime polarity followed by a non-singular prime-point 
polarity. 

We choose our coordinate system so that the collineation is given 
in canonical form 

V = Cx, 

where 

0= /C ei (a x ) 0 . 0 

/ 0 3Ja 2 ) . 0 

V 0 0 C ek (a k ) 

and C e (a) is the e x e matrix 



C t {a) = 


a 1 0 . . 0 

0 a 1 . . 0 

0 . . . a 1 

0 . . . 0 a, 
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Now let 


B = / B, 


0 B, 


0 0 



where B e is the ex e matrix 


Then 


1° 

0 

. 0 

l \ 

1 0 

0 

. 1 

0 

1 0 

1 

. 0 

0 

\l 

0 

. 0 

0 / 

(«l) 


0 

. 


K 


. 


0 

0 


0 • 


But B e C e (a) is the e x e matrix 


/° 

0 

. 0 

a\ 

0 

0 

. a 

: 

1 ° 

a 

. 0 

0 

\a 

1 

. 0 

0 / 
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which is symmetric. Hence A is symmetric. Now B is not only 
symmetric, but also non-singular, and, indeed, 

B = B-K 

The collineation y = Cx is the result of the polarity 

u = Ax 

followed by the non-singular prime-point polarity 

y «= Bu. 
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3. Null-polarities. The properties of null-polarities may be 
discussed along lines similar to those followed in § 2 in the discussion 
of polarities. We first consider the reduction, by means of a suitable 
choice of allowable coordinate system, of the equations of a null- 
polarity to a standard form. We prove the analogue of § 2, Th. I, 
using the same restricted type of transformation. 

Theorem I. By means of a transformation 

x = Px*, y = Py* 

of restricted type , a skew-symmetric bilinear form y' Ax can be reduced 
to the form 

(y?o x * - y* O + (y*. x t - vt *.*)+— + (yt_, *<*„.! -<-!<-»)> 

where (i 0 , ..., i 2r ) are 2 r distinct numbers chosen from the set 
(0 

If n ” y' Ax = a^iy^-y^). 

When a 01 = 0 there is nothing to prove. If a 01 #= 0, we take 



and a 01 (y 0 x x - y x x 0 ) = y% x* -yfx*. 

This proves the theorem when n = 1, and we now proceed by in- 
duction. There are only two cases to consider in this theorem, since 
a nn = 0, the matrix A being skew-symmetric. 

Case (i) If a in = 0 for all i, the theorem follows immediately 
from the hypothesis of induction (cf. § 2, Th. I). 

Case (ii). If not all a in are zero, let k < n be the largest value of i 
such that a in 4= 0. Then 

( k \ / k \ n-l n-1 

2 »nf + 2 ®in2/< Kv+ 2 2 

i-o / \i **0 / i=0i«0 

tc 

ik — 2 a ni X i> 
i =0 

ii = Xi (» = 0, ...,k-l,k+l n). 


Now let 
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This is a restricted transformation. Its inverse is 

k - 1 

X k = ®nk S Q'rti "h %>k> 

0 

= ^ ~ Oj • » • y Jc ?t)j 

which is also of restricted type. Since we can write 

( * \ / k \ n— 1 n-1 

S a ni Xi I - £ ) x n + S' S' 

<“0 / \i =0 / < “0 

n-1 n-1 

+ S' ^kiVk x i~ S' akiVjXk? 

j - o y-o 

we have, after transformation, 


n— 1 n-1 

t/'ria: = Vn£k~VkL+ S' S' 
*=o y«=o 


+ ( S (“Onfc S o «»<% + «dV*) 

= (*.+ s\^k-%(£«+ s J 6|gi)+ : s' 

\ i =0 / \ i =0 / i =0 i=0 


where the 6^, 6^ are easily calculated elements of IT. We now use 
the restricted transformation 

£* = £i (i = 0, — 1), 

S-&+S 

i «0 


and y'Ax^tj*it-7}li*+ S' S 

i =0 /=0 

The rest of the argument follows from the hypothesis of induction, 
and is exactly as in § 2, Th. I. The theorem is then proved. 

Since the rank of PAP is equal to the rank of A, and the rank of 
the bilinear form described in the theorem is 2r, it follows that the 
rank of A is 2r. 

f In the summations S', the value i — Jc is omitted. 

i 
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By a further transformation, not of restricted type, which merely 
rearranges the indeterminates, the bilinear form y' Ax can finally 
be reduced to the form 

2 (^2i^2i+l _ ^w+i-^at)* 

<=o 

When we state this result in terms of null-polarities we have 

Theorem II. A null-polarity has only one projective invariant, 
its rank, which, must be even. If the rank is 2 r the coordinate system can 
be chosen so that the equations of the null-polarity are 


U 2i ~ X 2i+V 
u 2i+l ~ ~ X 2i> 


[i = 0, . ..,r-l), 


Uj — 0 0>2r). 


( 1 ) 


It is easy to show that the reduction to this form is not unique, 
but we do not stop to prove this explicitly. 

An essential difference between the null-polarity and the polarity 
is that every point x is conjugate to itself in a null-polarity. For, if 
A is skew-symmetric, 


x' Ax = ( x' Ax )' = x'A'x — —x' Ax, 


and therefore 


x' Ax = 0. 


In other words the prime u — Ax always contains the point x. 

If two points x, y are conjugate, so that 

y' Ax — —x'Ay — 0, 

then any two points A x+py and px + ay on their join are also 
conjugate. For 

{Xx' +py')A(px+ ay) 

= Xpx'Ax + Xax'Ay + ppy'Ax + pay' Ay — 0. 

Hence every line in [n], joining a pair of points which are conjugate 
in the null-polarity, contains an infinite number of pairs of con- 
jugate points. We also have 

n n 

0 = y'Ax = 2 s a iiVi x i = 2 aniy^j-y^i) 

i<j 

— 2 a ijVij> (2) 

i<j 

where (...,Pn, •••) are the Grassmann coordinates of the line joining 



3. NULL-POLARITIES 


381 


the two conjugate points x, y. Conversely, if x and y are two points 
of a line whose coordinates (...,p i} , ...) satisfy a relation 

£ bijPij = 0, 

i<j 

then we may write this as 

£ £ Kiy^i = y'Bx = o, 

i = 0 i=0 


where b ij = -b ji ( i>j ), 

and b i{ = 0. 


Hence a null-polarity is associated with a system of lines (called a 
linear complex) whose Grassmann coordinates satisfy the relation (2), 
and conversely any linear complex is associated with a unique 
null-polarity. The geometry of a null-polarity is, essentially, the 
geometry of a linear complex and, as such, belongs to Line Geo- 
metry. 

A null-polarity can only be non-singular when n is odd, and then 
r — \(n +1). Let us briefly consider this case. We take the equations 
in canonical form. 


u 2i — X 2t+ 1> 
U 2i+l = ~ X 2i>. 


(i = 0, 


r-1). 


If 


y® = (yi---,yi) (i = o, 


k). 


are k + 1 independent points of [%] which determine an S k , the 
equations of the conjugate common to the transforms of all 

the points of S k are 

r-1 

£ ( y\j+i x 2i~y\j x 2j+i ) = 0 (* = o, ...,&)• 

i - o 

This S n _ k _ ! meets S k if these equations are satisfied by 

* = £A^, 

o 

that is, if the equations 

£ £ {ylj+iy2i~yhyh+i)^h — o •••>&) (3) 

ft =o j =o 

have a proper solution (A 0 , ..., A fc ). The (k+ 1) x (k + 1) matrix 
( a <ft) = | Yiiyij+iV?.] — yhyzj+i)^ 
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is skew-symmetric, and is therefore singular if k + 1 is odd. Thus 
there is always a proper solution of (3), and therefore S k meets 
S n _ k _ ly when h is even. On the other hand, if 1c is odd the matrix 
(a ih ) may be non-singular, and then 8 k does not meet 8 n _ k _ v This 
may be seen by taking 

yW=(£ i0 ,...,<U (i = 0, 

that is, by considering an 8 k determined by the first k + 1 vertices 
of the simplex of reference. In fact the equations of <S n _ fc _ 1 are then 

— - Xq — 0, Xj — — 0, . .., — x^ j — 0, 

and this does not meet the S k we have selected. 

An S k is called self-conjugate if it coincides with its conjugate 
S n _ k _ v This can only occur ifk = n — k — l. We then have k = r — 1. 
If this condition is satisfied there are ^-spaces which are self- 
conjugate. For instance, the k- space determined by the first, third, 
fifth, ..., nth vertices of the simplex of reference is self-conjugate. 
In fact, the equations of the conjugate S k are 

x 0 — s ?2 = ... — x n _i = 0. 

Let us find the condition that the space 8 k whose coordinates are 
•••) is self-conjugate. Since the method does not use the 
canonical form of the null-polarity, we assume that the null-polarity 
is given by the equations 

u = Ax. (4) 

Clearly, a necessary and sufficient condition that S k be self-con- 
jugate is that any two points in S k should be conjugate. Hence, any 
line in S k must belong to the linear complex 

S a i]Pij = 0. 

i<j 

Now any line in S k is the intersection of 8 k and an 8 n _ k+v Let 
(..., ...) be the dual coordinates of an S n _ k+1 . If this 

S n - k +i meets S k in a space of more than one dimension we have 

, S PiHo...ik-^ i °"' ik - i = 0 

for all values of i, j, but if the intersection is a line, its coordinates 
are (...,r y , ...), where [VII, §5] 

ii, 



3. NULL-POLARITIES 383 

It follows that a necessary and sufficient condition that 8 k be 
self-conjugate is that 


i< 1 it ik-t 


for all choices of elements **-« which are coordinates of an 
S n _ k +v But we can always find an S n _ k+1 for which every Grass- 
mann coordinate except an assigned one is zero. We can therefore 
write the necessary and sufficient conditions that S k be self-con- 

jugate as _ A 

2 a ijPiji 0 ...i k - t = °» 

i<j 

for all choices of i 0 , ...,i k _ 2 . 

A point-prime null-polarity 


u 


Ax, 


followed by a prime-point null-polarity 

y = Bu 

leads to the collineation y = BAx. 


If n is even, both A and B are singular, and the collineation is also 
singular. Hence we cannot expect to represent a non-singular 
collineation in this way. We may ask whether the representation is 
possible when n is odd. Similarly, we may enquire whether any 
collineation can be represented as the resultant of a point-prime 
null-polarity (polarity) followed by a prime-point polarity (null- 
polarity). In each case it is seen that certain conditions must be 
satisfied by the elementary divisors of the collineation, and that 
these are necessary and sufficient for the desired representation. 
Since the method of investigating the problems is the same in each 
case, we only consider one of them in detail, merely stating the 
results in the other cases. 

Theorem III. If n is odd and the ground field K is algebraically 
closed, a necessary and sufficient condition that a collineation in [»], 

V = Cx, 

can be represented as the result of a point-prime null-polarity 

u = Ax 


followed by a non-singular prime-point null-polarity 

y = Bu 
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ia that the elementary divisors of C— A I n+1 can be arranged in equal 
pairs : — (A— a x ) e i; (A — a 2 ) e », (A — a 2 ) e »; .... 

We take C in the canonical form 



and find the conditions that there exist skew-symmetric matrices 


A, B such that 


C = BA. 


( 5 ) 


An elementary argument shows that a necessary condition on C is 
that there is another elementary divisor (A — a) f corresponding to 
any elementary divisor (A — a) e of C — A/ n+1 . Let P = B~ l . Then 
the elementary divisors of C — \I n+1 are also the elementary 
divisors of 


PC - \PI n+ i = A- AP, 


which is a skew-symmetric matrix. Hence, if A = a is a latent root 
of C, the rank of A — aP, and hence the rank of C — aI n+1 , is 
» + 1 — 2/9, where p is an integer [Th. II]. There must therefore be 
2 p elementary divisors of the form 


(A — a)*i, . . . , (A — a) e >/>. 


This does not prove, however, that the numbers e v . . . , e 2p can be 
arranged in equal pairs, as is required by our theorem, and a more 
detailed argument is required to prove this. 

We divide A and P = B 1 into blocks of a size suggested by the 
canonical form of C. We write 


and 


/A n . 

A m 

\A kl . 

A kk ) 

= /P 11 

plk' 

\pki 

pkk 


where A ij and P ii are e { x e } matrices. Since A and P are skew- 
symmetric, 


(A*)' = -(A^), and (P«)' = _ (P*<). 


( 6 ) 
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Since PC = A, multiplication of the appropriate matrices shows 

that we must have . , AU 

= A”. 

Using (6), we find that 

{A»Y = (P%(a,))' = C'tfa) (P«Y — — C' e .{ct.j) pi*. 

But (A*)’ = - A* = - P^C^aY) 

for all values of i, j. Hence 

cy«f) P H = We,(«i). (7) 


We use this result to determine the form of the submatrices P ii . 
Write 

F* = /Pll • Pi* 

\Pejl • PtjH 

and define 

2>07k = ° ( h = 1> —><*)» ^7,0 = ° = 1 «/)• 

Equating elements in the Ath row and &th column on both sides of 
(7) we have 


aiPhk+Ph-ik = <* i p hk +p hk - 1 (h= \,...,e } \k= 1 «,). (8) 

Suppose, first of all, that eq + a;. Then, if h = A; = 1 in (8), we get 

(oci-ajpu = 0 , 

and therefore p n = 0. 

Next, if h = 1, k = 2, (8) becomes 

(a i -a ] )p n + p n = 0, 
and so p 12 = 0. 

Proceeding along the first row of P il , then along the second row, 
and so on, we find that 

PI* = 0, 

if oc t + a, } . 

On the other hand, if a f = a } , (8) becomes 

Ph—lk = Phk- 1 0 1 = •••> e i> k = 1, 


HPI 


25 
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If e t < e } , all elements on lines parallel to the line joining p lei to 
are equal, and if e t ^ ^ all elements on lines parallel to the line 
joining to p ejl are equal. Since also, if e { < e jt 

Phk = Ph+ko = 0 (h + k^e } ), 

and if „ „ , , , ^ . 

' Phk — Poh+k — ^ (h + k^. e i ), 

we obtain two forms for the submatrix P H : 

pii = 


1° 


0 0 a b 

\ 0 a b c . I 

a b c . ./ 


: °\ 
. 0 ' 

0 a 

. b 

c 




pH 


10 . 

• 

• • 

• 

0 

a\ 

r 

0 . 

• 

• • 

0 

a 

»\ 

0 . 

• 

. 0 

a 

b 

1 

• • 

. 

• 

0 

a b 

• 

c 

• 

: 


( 9 ) 


( 9 ) 


Thus the matrix P is divided into diagonal blocks of square matrices 

P= /P 1 0 . 

0 P 2 . 


0 0 


corresponding to sets of equal latent roots 

(Oli, •••>*) fcj) •••>&&,) ••• (&kr-i+ 1» • • • > ®fcr)> 

where k r — k. It is easily seen that the matrix A is similarly divided 
into diagonal blocks since 


PC = A 



Writing 
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A «■ /A 1 0 . 0 ’ 

/ 0 A i . 0 

\ 0 0 . A\ 


we may write the relation between the three matrices A, P, C as 

o • o (io) 

I 0 C a, _,.,(<**,-,+«> 0 1 

\ 0 0 <?,„(%,)/ 

In order that P be non-singular, each submatrix P 1 must be non- 
singular. We therefore consider whether there exists a non-singular 
matrix P i made up of matrices of the form (9), such that both P i 
and A 1 are skew-symmetric. The latent roots which occur in the 
set (oc ki i+1 , . . . , a^) are all equal, and our concern is with the suffixes 
( e fci_,+ 1 , •••>%)■ To simplify the notation we omit the indices and 
sub-suffixes in (10), and write the relation as 


P 


where a L = a 2 
We have 


,(*l) 

0 

0 

0 C e ,(a 2 ) . 

0 

• 

• 

• 

0 

0 

C ek (a k ) 

= a k , and we 

assume 

P= | 

r : 

l^p* 1 . 

P \ 

pkkl 


= A, 




where the elements P ii are x e t matrices given by (9). Since P is 
to be skew, it follows from (9) that 

P ii = 0 (i = 1, ...,&). 

Again, as in (6), P £j + (P H )' = 0. 

If e i = e } , we see from (9) that 

pn = (P«)', 

P*1 = — PH. 


and therefore 


( 11 ) 
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We now form a new matrix 

Q = (q tt ) 

from P , by picking out the element at the bottom left-hand 
corner of P ij . Then Q is a k x k matrix, and from (9) we see that 

<la = 0 (if e, < e,), 

but, if e, = e p = 

from (11). Now, if 

®i = . . . = e ai < e ai+1 — ... = e ai+at < . . . — ®aj+...+a» = 

so that the indices e lf ...,e k , which are assumed to be arranged in 
ascending order of magnitude, fall into s sets of equal indices, 


Q = /Q n o 
/ • Q 22 



> 


where Q u is an a t x matrix, and all the submatrices above the 
principal diagonal are zero matrices. Since 

det Q = n det Q u , 
i 

Q is singular unless all the matrices Q u are non-singular. Since Q u 
is a skew-symmetric matrix, it is only non-singular when a { is even. 
We now show that Q cannot be singular unless P is singular. For if 
Q is singular, its columns are linearly dependent. But if we replace 
the elements of Q in their original positions in P, we see that the 
remaining elements of the column of P which contains q {} are all 
zero. Hence if the columns of Q are linearly dependent the columns 
of P which contain elements q if are also linearly dependent. There- 
fore if Q is singular, P is singular. Since we assumed P to be non- 
singular it follows that the cq are even, and we have proved the 
necessity of the conditions of the theorem, that the elementary 
divisors of C— A/ n+1 can be arranged in equal pairs: 

(A - a^i, (A— a x ) e i; (A-a a ) e », (A-a 4 )«»; .... 
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The suffioienoy of the conditions is proved more easily. Assuming 
that k = 2 h, and that 

= e 2 , e 8 = e 4 , e. 2 h-i — e aft> 
we write, in equation (10), 



where E i is the ixi matrix 

E t = ( 0 0 . 0 1\ 

° : : ! ° • 

\i o . . 0/ 

It is clear that P l is non-singular and skew-symmetric, and by 
elementary calculation we see that the matrix A i defined by (10) 
is also skew-symmetric. We have therefore constructed the matrices 
A and B — P~ x required, and have proved our theorem. 

Other results, involving a combination of a null-polarity and a 
polarity, are most easily stated in terms of the properties of matrices. 

Theorem IV. Given a matrix C, necessary and sufficient con- 
ditions that there exist matrices A, B ( where A is skew-symmetric, and 
B is non-singular and symmetric), such that 

G = BA 

are that the elementary divisors of C — A/ n+1 corresponding to non-zero 
latent roots can be arranged in pairs (A — ot) e , (A + aCf, and the elemen- 
tary divisors X f corresponding to a zero latent root are even in number 
for any given even value off. 

Theorem V. Given a matrix C, necessary and sufficient conditions 
that there exist matrices A, B (where A is symmetric and B is non- 
singular and skew-symmetric), such that 

C — BA 
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are that the elementary divisors of C— XI n+1 corresponding to a non- 
zero latent root of G can be arranged in pairs (A — a)®, (A + a)«, and the 
elementary divisors X f corresponding to a zero latent root are even in 
number for any given odd value off. 


4. Simple correlations. We now consider correlations in [n] 
which are neither polarities nor null-polarities. We assume that the 
ground field K is algebraically closed, and investigate the conditions 
in which the correlation u ^ 


can be transformed to 


VWU LTV VX CllllOl V/l U1VU, W 

v = By 

by a non-singular transformation 

x = Py, or equivalently u = Pv, 

of point or prime-coordinates respectively. We may regard the 
transformation as a collineation in [»] instead of a transformation 
of coordinates, if we wish to do so. 

Algebraically, this is the problem of determining when a non- 
singular matrix P can be found such that 

B = P'AP. (2) 

If (2) holds, P' A' P 

and therefore, if A is an indeterminate, 

B — XB' = P'(A — XA')P, 

so that the matrices A — XA' and B — XB' have the same rank and 
elementary divisors. This gives a necessary condition for the 
existence of a matrix P satisfying (2). We now show that it is also 
a sufficient condition when the matrices A —XA' and B—XB' are 
non-singular. 

We can choose an element d (not + 1) in K such that 


C = A-dA' and D = B-dB' 


are non-singular. If A —XA' and B — XB' (which are both of rank 
n+ 1) have the same elementary divisors, so have 

G—pG' and D—pD', 

p being an indeterminate over K related to A by the equation 

(1+dp) X = d+p. 
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If there exists a non-singular matrix P such that 

D = P'CP, 

then B - (1 - d 2 )-* [D + dD '] 

= {\-d*)-'P'[C + dC']P 
- P'AP. 

Now let us suppose that C—fiC' and D—/iD' have the same 
elementary divisors. Since C' and D' are non-singular, there exist 
[II, § 9, Th. V] non-singular matrices R and S such that 

D — RCS, and D'^RC'8. 

From the second equation we have 

D = S' OR', 
and therefore RCS = S'CR'; 

that is, XC = CX', 

where X = SR. 

From XC = CX ’ 

we derive X*C = XCX' = C(X ') 2 , 

and X*C = XC(X'f = C{X')\ 

and finally X n C = C(X') n . 

Hence, if /(A) is any polynomial in ^[A], 

f(X) C = Cf(X’). 

But [II, § 10, Th. II] there exists a polynomial /(A) such that if 
Y =f(X), and therefore YC = CY\ 
then F 2 = X. 

Now, D = S'CR ' 

= S’CX'S 

- S'C(Y')*S 

- S'CY'Y'S 

= (S'Y) C(Y'S). 

Defining P = Y'S, 

we have proved that D — P'CP. 
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Henoe we have 

Theorem I. If A and B are two (n+ 1) x (n+ 1) matrices which 
are such that A — XA’ and B — XB' are non-singular , X being an in- 
determinate, then a necessary and sufficient condition that there should 
exist a non-singular matrix P such that 

B = P'AP 

is that A — XA' and B — XB' have the same elementary divisors. 

In the case in which A (and therefore B) is non-singular, we 
deduce as an immediate corollary 

Theorem II. A necessary and sufficient condition that the two 
non-singular correlations 

u = Ax, u = Bx 

should be equivalent is that the associated collineations 

y — AAx, y = BBx 
should be similar collineations. 

The next question which suggests itself is: can we find an 
(«+l)x(fH-l) matrix A such that A — XA ' has assigned elementary 
divisors? It is easily seen that this is not always possible. Let the 
elementary divisors of A — XA' be 

(A — a x ) e i, ...,{X-ct k ) e k. 

If a £ 4= 0, and (A — a*)®* is a factor of each determinant of A — XA' 
of order j, then (A — a^ x ) e < is a factor of each determinant of A' — XA 
of order j, and the converse of this statement is also true. Now, the 
transpose of A — XA' is A' — XA, and the transpose of a A-matrix 
has the same elementary divisors as the original matrix. Hence it 
follows that if (A — a*)®* is an elementary divisor of A — XA' (a £ #= 0), 
so is (X — aj 1 ) 6 *. 

Again, if a x = a 2 = ... = a r = 0, so that A*i, ..., A*>- are the 
elementary divisors of A — XA' corresponding to a zero latent 
root, we deduce from the fact that 

det {A - AA') = det (A' -XA) 

that the characteristic polynomial of A —XA' is of degree 

r 

u + 1 — 2 

It therefor© follows that 1 

k r 

Se* = n+ 1-Sc<. 

1 1 
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We have thus found some necessary conditions to be satisfied by 
the elementary divisors of A—XA'. If 4= ± 1, the elementary 
divisors (other than A e <) can be arranged in pairs, a typical pair 
being (A— (A — but when = ± 1 the elementary divisor 
(A — a t ) e i may, possibly, be paired with itself. We now prove 

Theorem III. Sufficient conditions that there exists a matrix A 
such that A — A A' has assigned elementary divisors are 

(i) the elementary divisors other than those of type A e can be arranged 

in pairs — (A — a^ x )A (a { 4= ± 1); 

(ii) if the elementary divisors are 

AV-mA^, (A-aj)A, (A— af 1 )A, ..., (A — a g )A, (A-a^jA, 
then 2'£ l e i + 2'£f i = n+l. 

l l 


Let D f (a) be the 2 f x 2 f matrix 


DM) = 


r 

• 

• 

0 

0 

• 

• 

• 

a\ 

■ 

• 

• 

. 

• 

• 

• 

a 

i\ 

• 

• 

• 

• 

• 

• 


i 

0 \ 

• 

0 

• 

• 

0 

a 

1 


• 

0 1 

0 

• 

• 

1 

0 

. 


• 

0 1 

• 

• 

I 

0 

0 

• 



0 / 

V 

• 

0 


• 

• 

0 

• 

• 

• 

• 

• 

• 

0/ 


Then D f (a) — 
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Except for numerical factors, the determinant of this matrix is 

(A— a)^(Aa— 1)^, 

■whilst the determinant of the (2/— 1) x (2/— 1) submatrix obtained 
by crossing out the first column and (/+ l)th row is 

(A -a)' A'- 1 , 

and the determinant of the (2/— 1) x (2/— 1) submatrix obtained 
by crossing out the first row and the (/+ l)th column is 

(Aa— 1)A 

Hence the highest common factor of all determinants of submatrices 
of order 2/— 1 is 1, since we have assumed that a= t= ± 1, and when 
a =(= 0 the elementary divisors of D f (a) — A D' f (a) are therefore 

(A — a) f and (A — a -1 )^ 

If a = 0 the above argument shows that the only elementary 
divisor is AA It follows that 



is a matrix such that A—AA' has the assigned elementary divisors. 
Whenever the conditions of Theorem III hold we shall take 

u = Ax 

(where A is given by (3) above) as the canonical form of a correla- 
tion having the assigned elementary divisors. For A—AA’ is 
non-singular, and we can apply Theorem I. 

The conditions of Theorem III have also been shown to be 
necessary in the case when A — AA' is non-singular and has no 
elementary divisors of the form (A + l) e . In this case 

A + A' and A— A' 

are both non-singular. We call this the simple case, and we have 
Theorem IV. In the case of a simple correlation the conditions 
of the previous theorem are both necessary and sufficient. 

f If a = ± 1, we find that the elementary divisors are (A± 1) / , (A± l) r , 
but we must use a different argument. 
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We observe that if n is even, A— A', being skew, is necessarily 
singular, and the simple case does not arise. We could extend the 
notion of a simple correlation to include the case in which A—AA' 
has elementary divisors (A + 1 ) e , provided that these can be arranged 
in pairs, but it turns out to be more convenient to proceed with the 
transformation of the most general type of correlation in [»] to 
canonical form. 

5. Transformation of the general correlation. In this section 
we show how to obtain a canonical form for any correlation in [n] 
which does not fall into the categories already considered. Such a 
correlation, which is not necessarily a polarity, a null-polarity, or 
a simple correlation will be called a general correlation. We treat 
the problem of transforming (or reducing) 

u = Ax 

to a canonical form as a purely algebraic one, working with the 
bilinear form y' Ax associated with the correlation. 

Let B and C be, respectively, the symmetric and skew-symmetric 
parts of A : that is, let 

B = UA+A'), C = \(A-A'). 

When A is transformed to P'AP, B is transformed to 
\{P'AP+P'A'P) = P'BP, 
and C is transformed to 

\(P'AP-P'A'P) = P'CP. 

Hence our problem may be regarded as that of the determination of 
canonical forms for the simultaneous reduction of the symmetric 
and skew-symmetric forms 

y'Bx, y'Cx. 

If B and C are both non-singular, we are in the simple case 
considered in § 4, and we may use the canonical form given there for 
A — B + C. When, in the notation of § 4, 

A = D e (a), 

y'Bx = £( oc+l)[{y 0 x 2e _ 1 + y 2e _ 1 x 0 ) + ... + (y e . 1 x e +y e x e _ 1 )] 

+ WUlXte-l + i*x) + • •• + (y e - 1 * 6+1 + ye+l*e-l)l 

and 

y'Cx = i(a- 1) [(y 0 *2e-l-2/2e-lZo) + — + (y«-l*«-y e *e-l)] 

+ i[(yi**-i-y*-i*i) + — + (y«— i* e +i -&+!*«— i)]- 
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We shall denote these expressions by 

J 3 (a f e) and y(a, e) 

respectively. When A is in the canonical form 


/D ei (a 1) 

0 

0 \ 

° 

£*(<*2) . 

? 

\ 0 

• • 
0 

D ek (ct k )/ 


the form y'Bx is equal to a sum of forms such as ft(a, e), differing 
only in the a, e and in the indeterminates (y 0 , . . . , (*o> • • • > x te- 1) 

involved. Similarly, y'Gx is equal to a sum of forms such as y(a, e). 
We shall write 

y'Bx = 2 /?(<%*, ef), y'Gx = 2 7 ( a t> e <) 

i i 

to express these facts. When we use this notation it is to be under- 
stood that each /}( a iy e { ) is paired with a y(a t -, e f ) having the same 
a i ,e i , and involving the same indeterminates, in the same order, 
and that two terms e { ) and ft(ctp ef) have no indeterminates 
in common. Similarly, y(a f , e i ) and y{a.p efj have no indeterminates 
in common. The results of § 4 can now be stated as 

Theorem I. If B and C are each non-singular , the symmetric and 
skew-symmetric forms y’Bx and y'Gx can be simultaneously reduced 
to the canonical forms 2 A( a i> e i) an & £ 7( a i> e i)- 

i i 

We must now extend this theorem to the case in which B or G 
is singular. We introduce five different pairs of forms 

(i) &(e) = (y 0 * 2e _ 1 + y Ze -i x o) + ( 2 /i *2e-2 + 2/26-2 *1) +••• 

+ (y e -l*e + ye x e-l)> 

7 x( e ) = (2/1 ^2«-l - 2/2e-l *1 ) + (2/2 ^2e-2 - 2/2e-2 *2) + — 

+ (y e -l x e+l ~ y e +l x e-l)> 
where y x (l) = 0 ; 

(ii) fi t (e) = (2/1*26-1 + y%e-x x i) + (2/2^2e-a + 2/26-2*2) +••• 

+ ( 2 /«-l*e+l + 2 /e+l* e -l) ! 

72(c) = (2/o*2e-l-2/26-l*o) + (2/l*2 e -2-2/26-2*l) + - 

+ ( 2 / e -l*e- 2 / e *e-l). 


where 


/?*(!) = 0 ; 
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(iii) fiz( e ) = (2/o^-a + 2 / 2 e- 2 *o) + (Vi^ze-z + Vze-z^i) + — 

+ {y e -z*e + y^e-i) + Ve-lXe-V 

73(e) = (2/l *2e-2 - 2/2e-2 ^l) + (2/2 ^26-3 “ 2/2e-3 *a) + • • • 

+ (Ve- lXe~y e Xe-l), 

where = y 0 x 0 , and y 3 (l) = 0; 

(iv) &(«) = (2/l*2«-l + 2/26-1 *l) + (2/ 2 ^2e-2 + 2/2e-2*2) + — 

+ ( 2 /e-l*e+l + Ve+ l*e-l) + 2 /e* e , 

74(e) = (2/o*2e-l - 2/2«-1 »o) + (2/l*2e-2 - </2e-2*l) + — 

+ (2/e-l* e -2/e«e-l)» 

where /? 4 (1) = 2 / 1 * 1 ; 

(v) As(e) = {y 0 x 2e + y 2R x 0 ) + {y 1 x 2e _ 1 + y 2e _ 1 x 1 ) + ... 

+ (2/e-i*e + i + y e+ i*«-i), 

7e(e) = (2/l*2 6 -^2e*l) + (2/2*2e-l-2/2 e -l*2) + — 

+ (2/e*e+l-2/e+lZ e )- 

Our object is to obtain the following generalisation of Theorem I: 

Theorem II. If B, G are, respectively, symmetric and skew- 
symmetric (n+l)x(n + l) matrices, the bilinear forms y'Bx, y'Gx 
can be simultaneously transformed to 

2 «i) + 2 A(e,) + 2 PM + 2 A(e,) + 2 A(e J + 2 &(«*). 

1 j k l m h 

and 

2 7(<*i, e i) + 2 7i( e /) + 2 7a(e*) + 2 7s( e i) + 2 7i(«m) + 2 7 s(e A ) 

i j k l m h 

respectively, with the conventions previously introduced for the pairing 
of the terms and for the indeterminates involved in the two expressions. 
We consider the equations 

Bx = 0, (1) 

Cx — 0. (2) 

Let these define, respectively, linear spaces S ri _ lt S rt _ x of dimensions 
— 1, r 2 — 1, and let these spaces intersect in a space of r 0 — 1 
dimensions. We transform our coordinates, choosing a new simplex 
of reference in the following way. In S f0 -i take r 0 independent 
points A n _ rt+1 , ...,A n . In S fi _ x there exist r x — r 0 further independent 
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points A n _ ri+1 , and in S,.^ there are r 2 —r 0 independent 

points which are independent of A n _ ra+ ly ...,A n> say 

A A 

^n-fi-r a +r 0 +l> • • • > ** n-r x • 

The points -4 n _ ri _ rj+fo+1 , ...,A n are linearly independent, and define 
the join of the spaces S ri _ lt S ri _ v We complete the new simplex of 
reference by choosing n — r 1 —r 2 +r 0 + 1 points in [«.] which are 
independent of those already chosen, denoting them by 

A A 

•••> ^n-rx-rj-HV 

In the new coordinate system S r x has the equations 


•Tn — • • • — X, 


n-r 0 


/S fi _ 1 has the equations 


x n 


... = X, 


n-r t 


= o, 


= o, 


and 8 r t has the equations 


*0 ••• x n—ri— ri+r 0 x n— rj+1 ... — X n - Ta 9* 


(3) 


In this coordinate system y'Bx is a form in the indeterminates 
(x 0 ,...,x n _ ri ), (y 0 , ...,y n - ri ) only, and y'Gx is a form in the indeter- 
minates 


(x 0 , . . . , x n—rir . r ,+ r# > x n _ r 1+ i, ...,a; n _ ro ), 

(2/o» • • • > yn-n-r I+ r 0 . Un— rj+l. • • • > 2/n— r 0 ) 


only. In proving our theorem we shall use a series of transformations 
of coordinates, but we shall arrange matters so that at every stage 
the equations of S ro _ v S ri _ x and 8 rt _ t are given in the new coordinate 
system by (3). The reader will readily verify that any transformation 
of coordinates which ensures this result is of the form 


i = Px, y = Py, 

where 

P = [D 0 0 0\ 

IE F 0 0 \ 

I G 0 H or 

\L M N K/ 


the matrices in the first, second, third and fourth rows (columns) 
having n — r 1 — r t + r 0 + 1, r 2 — r 0 , r 1 — r 0 and r 0 rows (columns) 
respectively. 
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Our proof of Theorem II will be by induction on the number of 
indeterminates x it y } involved, and for the purpose of the induction 
it is found convenient to impose a further restriction on P by 
requiring that the elements of F and H above the principal diagonal 
are zero, and that K = 7 r# . The transformation is then called a 
permissible transformation. 

We shall prove that y'Bx, y'Cx can be reduced to the forms given 
in the enunciation of Theorem II by a permissible transformation, 
after the preliminary transformation has been made. 

We commence our induction proof by considering the case n = 1 . 

Lbt y'Bx = ay 0 x 0 + b{y 0 x x + y x x 0 ) + ey x x x , 

y'Cx = %<>*!-- */!*<>), 

before any preliminary transformation is carried out. We have 
three cases to consider: 

(i) If B and C are non-singular, r 0 = r x = r 2 = 0 . No preliminary 
transformation is required, and any non-singular transformation is 
a permissible one. The result follows from the consideration of the 
simple case treated in § 4. 

(ii) If B = 0, (7 + 0, r x = 2, r 0 = r 2 = 0, and y'Cx is not essentially 
altered by the preliminary transformation. The permissible trans- 
formation 


tr.) 


transforms y'Cx to y 2 (l), an d y'Bx to /? 2 (1). 

If C — 0, B - {= 0, r 2 = 2, r 0 = r x = 1 or r 0 = = 0. If r x = 1, a 

preliminary transformation reduces y'Bx to the form ay 0 x 0 , and 
the permissible transformation 

£ = /«* 0 \ x 


(o n 


completes the reduction to /? 3 (1). We take y'Cx in its transformed 
form as y 3 (l). If r x = 0 we need no preliminary transformation, 
and the restricted transformation of § 2, Th. I, which is permissible 
in this case, transforms y'Bx to the form x a y 1 + x t y 0 (which is /? x (l), 
and can be paired with y x (l)), or to the form ay 0 x 0 + cy 1 x v This last 
form can be reduced by a permissible transformation to the form 
yo x o + Vi x v Thus we have the reduction y'Bx = /? 3 (1)+/? 3 (1), 
y'Cx = y 3 (l) + y 3 (l). 
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(iii) The only remaining case is that in which G is not singular, 
and therefore k is not zero, and B is singular, but not zero, so that 
ac = b 2 . Then r 0 = r % — 0, r x = 1, and the preliminary transforma- 
tion reduces the forms y'Bz, y'Cx to forms of the type 

dy Q x 0 p(y 0 x 1 ~yix 0 ), 

where d and p are not zero. The transformation 

i = /— d* 0 \ x 

\ 0 pd-i) 

is permissible, and reduces the two bilinear forms to 

£o^o> Vi£o~yo£i> 

which are /? 4 (1), y 4 (l) respectively. 

We may therefore assume as hypothesis of induction that 
Theorem II holds for a pair of forms y'Bx, y'Cx which can be 
expressed in terms of the indeterminates (y 0 , ...,y m ), (x 0 ,...,x m ), 
where m<n. 

Now, our preliminary transformation expresses y'Bx , y'Cx in 
terms of the indeterminates (y 0 , ...,y n -r 0 )> ( x o> ••■> a V-r„)> and not 
in terms of fewer indeterminates. By the hypothesis of induction 
we can reduce these forms to canonical forms, as required by 
Theorem II, by a permissible transformation if (i) r 0 > 0, and (ii) the 
indeterminates {y 0 , ...,y n _ ro )> ( x o> ••■> x n-r 0 ) are subjected to a 
preliminary transformation. But no preliminary transformation 
is necessary since, in the new coordinate system r' 0 = 0 ,r' 1 = r 1 — r 0 , 
r % ~ r % ~ r o> an( l, if n’ — n — r Q> the equations of ^ r - o _ 1 are 


x o = ••• = x n' = 0 , 

those of S r . j are _ _ 

•^0 — ••• r'x 

and those of are 

x 0 ~ ••• ~ X n'-r’ 1 -r‘ t = x n’-r\+\ = • • • — x n - = 0 , 

in accordance with (3). Hence the permissible transformation 



transforms the bilinear forms in the indeterminates (y Q , ...,y n _ f# ), 
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(x 0 , ...,®n-r 0 ) * nto required forms. Since we obtain from this 
the permissible transformation 

£ = ID 0 0 0\<r 

UFO 0 \ 

I Q 0 H 0 I 

\o 0 0 ij 

on the indeterminates {y 0 , ...,y n ), (x 0 ,...,x n ), which also reduces 
y'Bx, y'Gx to canonical form, our theorem is proved if r 0 >0. We 
may therefore now confine ourselves to the case r 0 = 0. 

Now suppose that r x > 0. A preliminary transformation gives us 

n—fi 

y'Bx = £ bitfiXp 
0 

n 

y'Cx = 'Z'c ij y i Xp 
0 
It 

where £' denotes summation over the values 
o 

0 ,...,n-r x -r^, n-r x +\,...,n (4) 

of the suffixes i,j. In order to avoid the inconvenience of having to 
introduce new symbols for the indeterminates with each successive 
transformation, we shall start each step of the argument afresh 
with the indeterminates ( x 0 ,...,x n ), (y 0 ,...,y n ), except in cases 
where this would cause misunderstanding. 

Using the transformation of §3 applied to y'Cx, we have to 
consider only Case (ii), in which not all c in = 0, since, if c in = 0 for 
all values of i, r 0 > 0, contrary to hypothesis. The restricted trans- 
formations of that section are permissible, because of the absence 
of terms c {] in y' Cx in which either i or j lies between n — r 1 —r i + 1 
and n — r v both values inclusive. Hence, using the same symbols 
as above, a permissible transformation reduces our bilinear forms to 

n—fi 

y'Bx = £ b^y^p 
0 

n— 1 

y’Cx = S' CijVi^j + iyn^k-yA), 

0 

* n— 1 

where k is one of the suffixes given by (4), and £' omits the suffixes 

o 

n and k, besides those already mentioned. We now have two cases: 


H P l 


26 
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Case (i). k>n—r v The forms 

n— r x n—1 

£ biiViXf, £' CijViZj 
0 0 

are forms in the indeterminates 

(x 0 , . . X k _^, x k+l , (?/o> ■ ■ ■ ) Vk- 1 ’ Vk+ 1> • ••> Vn—l) 

for which ro = 0, ri = r, — 2, r 2 = r 2 , n' — n— 2, 

and we see that these forms require no further preliminary trans- 
formation. By the hypothesis of induction, they can be reduced to 
canonical forms by a permissible transformation, and it is easily 
seen that a permissible transformation on the indeterminates 
(x 0 , ...,x k _ v x k+1 , ...,a; n _ 1 ) is still permissible when we add to the 
equations of transformation the further equations 

= ® k> %>n ~ 

in their proper place. Since 

#a(l) = 0, 7a( 1 ) 

the theorem is proved in the case k>n — r 1 . 

Case (ii). k^n — r v Since c kn y k x n was a term in the prepared 
form y'Cx, we must have k^n — r x — r 2 . Since a transformation of 
coordinates which permutes x 0 , ...,x n _ Ti _ Tt , but does not affect the 
remaining x t , is permissible we may, after a suitable permissible 
transformation, take k — 0. We now write 

n-r x 

y'Bx = y 0 £ + yx 0 + £ b aVi x i> 

n—1 

y'Cx = y n x 0 -y 0 x n + S' CijViXp 

n—1 

where the summation S' is over the values 

i 

1, ...,n-r 1 -r i) n-r 1 + 1, 1 
of the suffixes i, j, and 

n-n n-r, 

£ = £ b { x it S hyp 

o o 

where 2 b 0 = b 00 , and = b { . 
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Now, by hypothesis, the matrix 


/ ^0 

°i 

°2 • 

, V 

bl 

b ii 

b 12 

cr- 

M 

3 

1 

\. • 

• 

• i 

■ * / 

Wn— r x 

1 

• 

* On-tin— f\f 


( 6 ) 


is of rank n — r l + 1. We first prove that this implies 
submatrix 


'ii 


I • 

\b n — r ,l • t>n— r-J 


that the 

( 7 ) 


is of rank n — r x or n — r x — 1 . For, suppose that the rank of (7) were 
less than n — r x — 1. Then there would exist at least two distinct 
relations 


2 \b it = 0 (j = 1 , ...,n-r x ), 

i = 1 


n— r x 

(j = l,...,»-ri), 

i = 1 

connecting the rows of (7). If p, cr are chosen so that 


n-n n-n 

p 2 \b i + (T 2 Pi b i = o, 

i i 

then the multiples 

0, p\ i + <r/i v . . . , p\ n _ ri + <rp n _ Tx 

of the first, second, ..., (n — rq+ljth rows respectively of (6) are 
linearly dependent, and (6) is singular, contrary to hypothesis. 

Case (iia). Suppose that the matrix (7) is of rank n — r l , and 
consider the forms 

n— r x n — 1 

2 b^xj, S' tijViXj' 

i i 


For these forms in the indeterminates (x x , ...,x n _ 1 ), (y v ...,2/ n -i) we 

have , o / i 

n = n — 2, r[ = r x — 1, 


K = 


Hence no preliminary transformation is necessary, and by hypo- 
thesis there is a permissible transformation which reduces these to 
canonical form. The equations of transformation, together with the 

equations , _ r _ 

&0 — x 0> 5n — *n 


26-2 
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are seen to define a permissible transformation of the indeter- 
minates (x 0 ,...,x n ). Before making this transformation, however, 
we make the permissible transformation 

x 0 = z 0 , 

x i = x i -c i x o (i = 1, ...,n-r x ), 
x i = x i (i = n — r 1 + 1, ...,n) 

on the original forms (5), our object being to simplify the term 
y 0 £ + V x o before carrying out the final permissible transformation 
described above. We find that 


n— r. 


+ *0 

n—r t 


y'Bx = y 0 (b Q x 0 + 2 Mi - x 0 2 Mi) 

1 1 

( _ n-r, _ _ n-r x \ 

Mo + 2 Mi - y 0 2 Mi) 

l ~ Vl _ _ _ 

+ 2 b^yi-Ciy^ixj-CjXo) 

n-r x __ __ 

= 2b 0 x 0 y 0 + y 0 l + yx 0 + 2 b ;j y i x j 

( n ~ r i _ \ /n-r, __ \ 

2 b u c { xA -IS b ij y i c j I x 0 


( n-r, n-r, \ 

2 b ij c i c j -2 2 Mij 


*/<>*<» 


and 


_ n-r x -r t _ _ _ _ 

y'C'* = Vn x 0 - 00 +- 2 Cii(«/i - C, y 0 ) (Xj - C } x o) 

n— 1 

+ 2 

n-r,+ l 

= Vn x O-yo x n 

n— 1 n—r x —r 2 n—r x — r t 

+ 2' - y 0 2 -x 0 2 cn^Vi, 

_ n_r ‘ -r * 

since x 0 y 0 2 c ij c t c j — 0. In the first expression we have written 

n— r, n— r, 

1=2 Mi> M2 

i i 
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Now, since the matrix (7) is non-singular, we can choose the 
constants c v ...,c n _ ri so that 

n— r i 

£ bijC } = b { ( i = l.—.n-fj). 
i - 1 

This simplifies the expression for y'Bx to 
2b 0 x 0 y 0 + S byyiXj-l S 

n—ri 

= py 0 x o+ £ bijVi^ 

say. Now make the further permissible transformation 

*?=■«< (i = 0, 1), 


We then have 


_ n-r,-r. 


y'Bx = py* x* + S b i} yfxf, 
y'Cx = ytx*-y*xt+ \' c u yfxj. 


Now p cannot be zero, otherwise B would be of rank n — r v instead 
of n — r, + 1. Hence, by another permissible transformation which 
consists of replacing p^x* by x$, we may take p = 1. Now apply 
the hypothesis of induction to the forms 

£* hy^xf, S' Cyyfaif 

and, as explained above, we deduce a permissible transformation of 
the indeterminates (x*, x*), (y*, ...,?/*) which gives us 

y'Bx = y$x$ + Z /?(a f , e { ) + 

y'Cx = y*xJ-y3 1 x* + ry(a i ,e i )+2Ty i (e i ). 

Since these forms may finally be written 

y'Bx = fi t (l ) + Z/J(ct ii e t ) + Zi:fi i (e i ), 
y'Cx = y 4 (l ) + Zy(a i ,e i ) + 2Zy i (e i ), 

this completes the proof of Case (iia). 
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Case (ii&). Suppose that the matrix (7) is of rank n — r x — 1. Then 

the equations n _ n 

2 = 0 (1=1,...,%-^) 

i -l 


have just one solution. Apply the methods of § 2 to reduce 

n— r t 

2 to canonical form. The restricted transformation of that 

section can evidently be amplified to a permissible transformation 
of the indeterminates (x 0 ,...,x n ), (y 0> ...,y n ). Now consider the 
canonical (n— r x ) x {n— r x ) matrix corresponding to the reduced 


n— r i 


form 2 yi x f . Since this matrix is of rank n — r x — 1 , it is clear that 

one row consists entirely of zeros. That is, preserving the same 
notation, , 


J ih 


0 = 


for some value of h (1 ^h^n — r^), so that the one solution of the 


equations above is now 




We have two cases to consider :h^ ; n — r 1 — r 2 ,h>n — r 1 — r 2 . 

Case (ii6 x ). h>n — r 1 — r 2 . We have, with the previous notation, 


n—r x 

y'Bx = y 0 £+yx 0 + 2* b^x,, 

n— 1 

y'Cx = y n x 0 -y 0 x n + 2'%2/i^, 

where the summation omits the value h. We now proceed as in 
Case (iia), making the substitution 


x 0 = x 0 , 

x t = Xi-cfio ( i = 1, ...,h-l,h+l,...,n-r 1 ), 
%h = x h > 

x t — x i (i = + 

This is a permissible transformation, and gives us 

y'Bx = 2 0 £o[- 2 2* biC { + 2* 2* b^y^ 

n—r x n—r x 

+b h (y 0 x h +y h x 0 ) + y 0 2* Mi + *o 2* b t y { 

0 0 


n—r x n—r x 

-Vo E* biters o 2* b^Cjyt. 



5. TRANSFORMATION OF THE GENERAL CORRELATION 407 
We can choose c, , . . . , c A-1 , c h+1 , . . . , c n _ ri so that 

n—ri 

= b it 

the matrix of coefficients having rank n—r x — 1, and then 

»— r, 

y'Bx = y 0 (^o+2«ft)+(m+7^)»o+S*6«yi*/- 

Since £ is of rank n — r x +l, g cannot be zero. At this stage 
we have 

y'Cx = y n x 0 -y 0 x n 

n— 1 n-ri-r 8 n— n-r, 

+ S' - y 0 S - ®o S c « c ^i> 

since n — r x — r a < & < n — r v The permissible transformation 

xf — x { (i = 0, 1), 

_ n-n-r* 

4 = *n + S «#«<*/ 

1 

leaves y'Bx unaltered, and transforms y'Cx into 

y'Cx = y$x*-y*xt + 'Z' c^yfxf. 

As a final transformation on 

y'Bx = y*(px* + qx*) + (py$ + qyt) x* + £* b^tfixf, 

we write 

£i = *? (» = 0, .. .,h-l,h+l, 

£h — p x * + q x t- 

This is a permissible transformation, and gives us 


n—r x 

y'Bx = y 0 Zh + Vh£o+ 2* 

71—1 

y'Cx = y n £o-Vo£ n + S' CiiViZj- 
We now make use of the hypothesis of induction to reduce 

S and 2' c uVd} 
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to canonical forms by a permissible transformation on the indeter- 
minates (g lt ...,£v-i,£»+i* —,Vh-i>Vh+v noting 

that no preliminary transformation is necessary. For this set of 
indeterminates, 

n' = n- 3, r[ = r x — 1, r' 2 = r 2 -l, 

and the transformation induces a permissible one of (£ 0 , 

(Vo> •••>'»/«)• We have, finally, 

y’Bx = y h x 0 + y 0 x h + E 0(a i} e { ) + EE &(e,), 
y’Cx = y n x 0 - y 0 x n + E y(a it e { ) + EE y<(e,), 

that is, 

y’Bx = y? 6 (l )+Ep(a i ,e i ) + EEp i (e j ), 
y’Cx = y 5 (l) + E y(a t> e { ) + EEyfa). 

Case (ii & 2 ). h^n — r x — r 2 . By means of a permissible trans- 
formation, which interchanges the suffixes 1 and h, we have, 
preserving the notation, 

n-r, 

y’Bx = y 0 £+yx l „+ S h nVi x p 
2 

n- 1 

y'Gx = y n x 0 -y 0 x n + S' CijViXj- 

In considering Cases (iia) and (iii,) we left the forms 
^b ij y i x j and Ec^Xj 

unaltered until the end, and then reduced them to canonical form. 
In this case it is necessary to reduce the forms 

n — ri n— 1 

S b ijVi x i an( i S' c ijVi x i 
2 1 

to canonical form at this stage. In applying the hypothesis of 
induction to the two forms, we take the indeterminates in the order 

X 2> •••> x n— ri— rj> x n— rj— rj+l> • • • s r*> *^n— rj+l> •• • > x n— 1* 

We see that for these indeterminates, 

n'=n- 2, ri = r x , r 2 = r 2 , 

and no preliminary transformation is necessary. Furthermore, a 
permissible transformation in these indeterminates induces a 
permissible transformation in ( x 0 , ...,x H ), (y 0 , ...,y n ). For the sub- 
matrix H which appears in the matrix of a permissible trans- 
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formation has only zeros above its principal diagonal, and therefore 
the equations of such a transformation are, as far as the displaced 
indeterminate x x is concerned, 

n-ri-r, 

£1= S OtZt + bx j. 

2 

Hence, when x x is replaced, and the additional equations 

£o = x o> 

L = x n 

are added, we have a permissible transformation of (x 0 , ...,x n ), 
(y 0 , •••>y n )- At this stage we have 

y'Bx = y 0 £, + yx 0 + Z p{a t , ef) + ZZpfef), 
y'Cx = y n x 0 -y 0 x n + Z y(<x { , ef) + ZZyfe,), 

where £, rj involve the same set of indeterminates as before the 
transformation, with different coefficients. For simplicity we 
preserve our previous notation, and write 

n-Ti 

y'Bx = y 0 Z+yx „+ £ b if y t x it 
2 

n— 1 

y’Cx = y n x 0 -y 0 x n + £' c ijyi x p 

n—ri n— 1 

where £ b ij y i x j and £' c ij y i x j are in canonical form. We now 
2 1 

proceed as in Case (ii&j), this time taking h = 1, and obtain 

71- fj 

y’Bx = y 0 x l + y 1 x 0 + £ b^x,, 

2 

71— 1 

y’Cx = y n x 0 -y 0 x n + S' 

l 

where the coefficients b i} , c^ are unchanged by these final transforma- 
tions. Hence we have 

y’Bx = y^ + y^o + Z p^efj + ZZp^ef), 
y'Cx = y n x 0 -y 0 x n + Z y(a t , ef) + ZZ y^ef). 

The indeterminates x v y x occur in 

Zy^eff+ZZy^ep, 

Z P(<%i, ef) + ZZ Pfcf). 


but not in 
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It follows that x v y x appear in some term On the other hand, 

x 0 , y 0 , x n , y n do not appear in either of these two expressions. 
Suppose that x v y x appear in 

Vi( e j) = (yi^ ai -y ai x 1 ) + (y at x aa -y at x at )+.... 

The corresponding 

At( e i) = (ya l Xa % J ry a% x a l ) + -> 

where a x , a 2 , ... are different from 0, \,n. Now, we may write 

(yo x i+yi x o)+(ya l x a t +y at *a 1 )+-~ = A(«*+i). 
iy n x o-yoXn)+(yiXa 1 -y ai x i )+ — = 7^+1), 
the terms in 1), Yi{e } + 1) being taken in the order 

x n> x 0> x l> X a x ’ x a%> •••• 

This completes the proof of Theorem II when r x > 0. 

We suppose now that r 0 = r x = 0, r 2 > 0. The analysis in this 
case is similar to that followed when r x > 0, and it will only be 
necessary to indicate the changes. After our preliminary trans- 
formation we have 

n n—r t 

y'Bx = '2,b„y i x j , y’Cx = X c^y^. 
o o 

We may assume that not all b in — 0, for in that case the hypothesis 
of induction would apply immediately. Hence the permissible 
transformations of § 2 give us either 

n— 1 

y'Bx = X b {j y { x f + ky n x n (k 4= 0), 

0 

n— r, 

y'Cx = X Cijyt x j, 

0 

71—1 

y'Bx — x' (*/ n **+*/**»), 

0 

n-r, 

y'Cx = X c# «/<**. 

0 

In the former case the permissible transformation 
x t = x t (i = 0, ...,»- 1), » n = 


and 

or 

and 
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enables us to take k — 1, and by applying the hypothesis of induo- 

n— 1 n— r t 

tion to 2 2 c^ytXp we finally obtain 

o o 

y'Bx = Z’^(a*,e i ) + ZT/? i (e i )+/? 8 (1), 
y'Cx = i:y(a i> e f ) + ri:y i (e / ) + y 3 (l). 

We now consider the second alternative: 

Case (i). k>n—r 2 . This is dealt with in exactly the same way as 
Case (i) above, interchanging the roles of B and C. We find that 

y'Bx = r/?(a,,e,)+rr^(e,)+A(l), 
y'Cx = 2'y(a i ,e i ) + r2:y i (e i ) + y 1 (l). 

Case (ii). If k^n — r 2 , the interchange of suffixes 0 and k is a 
permissible transformation, and we may write 

n—1 

y'Bx = 2 bijyiXj + iy^ + yvXj, 

n—r t 

y'Cx = £ c^xj. 
o 

The methods followed above, for r x > 0, still apply, interchanging 
the roles of B and C, but since C is skew-symmetric and therefore 
always of even rank, only Case (ii&) appears. Case (ii6 x ) gives us 

y'Bx = 1), 

y'Cx = 27y(o i ,e < ) + SEy i (e j ) + y 5 (l). 

Case (ii& 2 ) leads to a result exactly similar to Case (ii6 2 ) above. 

Finally, if r 0 = r x = r 2 = 0, we are in the simple case, and the 
reduction of § 4 applies. 

6. Canonical forms for correlations. Having completed the 
reduction of the forms y'Bx, y'Cx to the canonical forms 

y'Bx =£ *)+ 2 2 /?«(«}), 

i i= 1 j = i 

J/'Ca; = 2 r(«i. e<) + S S 7i(ef)> 

1 i~l i -1 

we arrange the indeterminates involved more conveniently so that 
the 2e i indeterminates involved in /?(a f , e t ) and y(a f , e*) are 

3'2(e 1 +...+ej_ l )> •••> a ' 2 (e 1 +...+e<)-l> 

Vt{ei +.. •••» ?/a(«i+...+ei)— X* 


and 
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We then order those in y<(fy) so that their suffixes exceed the 
suffixes of the indeterminates in /? a (e 6 ), y a (e b ) if a<i, or if b <j 
when a — i. We now consider the matrix of A = B + C. We have 
at once 



the diagonal submatrices being defined as follows: 

D e (a) is defined in the proof of Th. Ill, § 4. It is a 2e x 2e matrix, 
and D e (a) — AD'(a) has the elementary divisors (A — a) e , (A — a -1 ) e 
if a=# 0, and A e if a = 0. 

F^e) (i = 1, 5) is the matrix of the bilinear form 

&(e) + 7<(e). 


It is a simple matter to verify that these matrices are: 


(i) F,(e) = 


, ! : \ 



• • 
1 1 


1 

1 -1 
i 




/ 


The methods already used for determining elementary divisors 
show that F x (e) — \F[(e) has (A— 1)®, (A — 1 ) e as its elementary 
divisors. 



6. CANONICAL FORMS FOR CORRELATIONS 


413 


(ii) 


*«(«) - 



1 



1 



and F 2 (e) — \F' 2 (e) has the elementary divisors (A+ l) e , (A + l) c . 


(iii) 



and F 3 (e) — \F' 3 (e) has the one elementary divisor (A — l) 2 * -1 . 



and F t (e) — \F' t (e) has the elementary divisor (A+ l) 2 *. Finally, 
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and the (2e+ 1) x (2e+ 1) matrix F 5 (e) — XF' 5 (e) (A indeterminate) is 
of rank 2e. 

Now, if A — XA' is non-singular for indeterminate A, there are no 
zero matrices on the diagonal of ( 1 ), nor are there any F 5 (e) matrices. 
We can now give Th. Ill, § 4 in its complete form as 

Theorem I. Necessary and sufficient conditions that there exists a 
matrix A such that A — XA' is non-singular and has assigned elemen- 
tary divisors are : 

(i) the elementary divisors, other than those of type X e , (A — l)-'', 
(A+ 1)® can be arranged in pairs (A — a*)^, (A — af 1 )*; 

(ii) the elementary divisors (A — 1)^, for a given even integer f, are 
even in number; 

(iii) the elementary divisors (A + 1)®, for a given odd integer g, are 
even in number ; 

(iv) if the elementary divisors are A e », ..., A e «, (A — 1)A, (A — 1)A, 
(A+ 1)® 1 , (A+ 1)®«, (A-ajfi, (A— af x )\ ...,(A — a d ) A <*, (A-aj 1 )^, 
then 

2 2 e.j + 2/i + 2s , i + 22^f = w + 1. 
iii i 

The necessity of these conditions is an immediate consequence 
of the reduction to a canonical form and of the results given above, 
which give the elementary divisors of the various submatrices in 
the canonical matrix (1). The sufficiency of the conditions is evident, 
because we can construct a matrix A as given by (1), after grouping 
the elementary divisors into suitable pairs. 
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The canonical form (1) may be simplified to some extent. We 
observe that F x (e) — X.F' 1 (e) has the same elementary divisors 
(A — 1)*, (A — l) e as D„( 1). Hence, by § 4, Th. I, there exists a non- 
singular matrix P such that 

P'F 1 {e)P = D e {l). 


For similar reasons we can find non-singular matrices Q, R, 8 

such that n , „ . \ n . _ . 

QF 2 {e)Q = D e {-l), 


IT/F a ‘e) 

° \ 

R — ^2e-l(l)> 

\ o 

ml 

1 

S'(F t (e) 

0 ) 


\ o 

ml 

! 


Hence, by a further transformation of coordinates, we can assume 
that (1) contains no submatrices F x (e) or F a (e), and that for given 
/ f , g i there is at most one F z (e), and one F t (e). In future we shall 
suppose that this has been done. 

We now consider the case in which A —XA' is singular, for in- 
determinate A. In the canonical form of A the number of zero rows 
(and columns) is r 0 , where n — r 0 4- 1 is the rank of the matrix 

and therefore n — r 0 + 1 is also the rank of the matrix 



This rank is invariant under transformations of coordinates, since 

/ P'AP \ = / P'AP\ = P'tA\P. 

\(P'AP) 7 \ P'A'P] U'/ 

If the canonical matrix A contains k matrices F 5 (e), we see that the 
rank of A — A A' is n — r 0 +l — k. Hence k is also an invariant of A . 
We now prove that the e\ which appear in the submatrices F 5 (e f), 
and which determine the number of terms in y' Ax corresponding 
to each submatrix F h {e\), are also invariants of A. We assume 
that k > 0. 
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We consider the linearly dependent rows of the matrix 

(A-XA')x, 

writing „ n ~ r » , . 

X t = ^ {a ij -Aa ji )x j (t = 0 n-r 0 ). 

i= o 

These linear forms have k independent relations connecting them, 
say n _ r 

= 0 , ( 2 ) 

1 =o 

where we may assume that the d } { A) are polynomials in A. Amongst 
all possible relations of the form (2) we consider those in which the 
maximum of the degrees in A of the polynomials d 0 , d n _ ro is least. 
Let this smallest maximum be m v and suppose that there are q t 
relations (2) of this degree, say 

0i> 

We now consider all relations (2) which are not of the form 

i 

and amongst these we again consider those of lowest degree, which 
degree we denote by m 2 . If 

is a basis for this set of polynomials, we then consider those relations 
(2) of lowest degree which are not expressible in the form 

i =1 

and so on. We obtain a set of polynomials 

01, 

which form a basis for the relations (2). We prove that the integers 
q v ...,q p , m v ...,m p are invariants under allowable transformations 


of coordinates. If 


x = Px 


is such a transformation, the relation 


becomes 

where 


Za,( A)X,. = 0 

Zd i (X)q i jXj = 0 , 

Q = = P- 
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Since q i} is in K, the degree of any relation is unaltered by an 
allowable transformation. This proves the invariance of q v ...,q p , 
m lt We now associate these integers with the ef. 

In the canonical form (1) the only submatrices O such that 
G—AG' is singular are the k matrices F 6 (e\). Since all the sub- 
matrices are on the principal diagonal of A, we obtain relations (2) 
only from the F & (e\) matrices, and there is one relation corresponding 
to each matrix. Let F h (e)) have rows and columns corresponding 
to the indeterminates 


2e (® ®#)* 

Then X m = (l-X)x m+2e , 

m+1 = (1 **'m+2e— 1 "b ( 1 "b A) X m+ % e , 

^■m+2 = (1 A) %m+ 2e— 2 "b (1 "b A) ^ m +2e_u 

^m+e—1 = ( ^ A) •X'm+e+l "b ( 1 "b A) * m + e + 2 > 
m+e = (1 ”b A) % m +e+l> 

^m+e+1 ~ ( 1 — x m+e~ 1 — ( 1 + A) X m+e , 

m+2e = (1 A) ^ "b A) %m+V 

The relation of least degree connecting X m , X m+2e is 

= (1 + V e X m - (1 + A y- 1 (1 - A) X m+1 

— (1 + A) e_2 (l — A) 2 X ot+2 ... ± (1 — A) e X m+e = 0, 

and ^r x , fric form a basis for the relations (2). We deduce that if 
... then 

e\ = e| = ... = e\ x = m 1 < e® l+1 = ... = e® a = m 2 < ... 

= 4p = m P tip = k )> 

and therefore the integers ef , . . . , e| are invariants of A under allow- 
able transformations of coordinates. 

We sum up our results in 

Theorem II. Let A be any (»+ 1) x (n - b 1) matrix, let {A, A') be 
of rank n—r+ 1, let A — A A' be of rank n—r+l — k, and let 

fv->fk 

ar 


HPI 
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be a basis of ‘polynomials of minimum degree for the relations 
connecting the rows of (A—XA')x. Let the degree of xjr t be m { , and 
suppose that 

m x - ... = m Qi = n x <m Sl+1 = ... = m Qt = n 2 < ... 

= m Qp = n p (?p = *)• 

Let the elementary divisors of A— XA' be 

A%...,AS (A-l)Vi- 1 (A- 1) 2 * -1 , (A+l)«»i,... f (A+l )»*, 

(A— (A— .... (A— a d )\ (A — a^ 1 )^, where a t may be ± 1, 
andf 1 <f 2 <...<f b ,g 1 <g 2 <...<g c . 

Then 

2£e + B{2f — * 1) -f- 2Sg + 2 Bh — n — r + 1 — 

and <Aere a non-singular matrix P such that the correlation 

u = .da is transformed into the correlation v = By by the transforma- 
tion x = Py, where 

B = P'^P 



= k, and the zero matrix has r rows and columns. 
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7. Some geometrical properties of correlations. We begin 
with a simple correlation [§4], 

u = Ax, (1) 

this having been defined as one for which A+A' and A— A' are 
both non-singular. The canonical form is given by (1), where 


■®/ 1 ( a i) I 

‘ Df.iccJ 

A point x has no transform if it lies in the space given by 

Ax = 0. (2) 

Remembering that D e (a) is the 2e x 2e matrix 

“\ 

a 1 ' 

a 1 
• • 

a 1 , 

1 0 
0 

/ 

it is easily seen that the space given by (2) is of r — 1 dimensions, 
being spanned by the vertices 

^2ei+et> •••» -^2(«x+...+e r _i)+er> 

of the simplex of reference. This space may be called the funda- 
mental space of the correlation corresponding to the zero latent root 
of A — XA’ . Similar working shows us that the transform of any 
point which has a transform is a prime which always passes through 
the space spanned by 

*0. ^tei> •••> •‘^2<e 1 +...+«r-l)• 
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This is also a space of r— 1 dimensions which, we see at once, is 
the space defined by the equations 

A'x = 0. (3) 

We may regard this as the fundamental space corresponding to the 
infinite latent root of A — XA' . We note that the spaces defined by 
(2) and (3) do not meet. 

The space given by 

(A — oq-d.')# = 0, 

where is a latent root of A — X A', is called the fundamental space 
of the correlation corresponding to a v Recalling that the dual 
correlation is 

v = A'y, (4) 

we see that a necessary and sufficient condition that a point x 
should have the same transform under (1) and (4) is that it should 
lie in a fundamental space corresponding to a latent root of A — XA'. 
Let us consider the space defined by 


(A— oCj^A^x = 0, 

and assume that 

0 + oq = a 2 = ... = a ( , 


and 


a i + a i ( i>t )• 


We see that this space is an S t _ v defined by the vertices 


2Se+/x> ^2Xe+2 /,+/,» •••» ' A -2X«+2(/i+...+/ t _ 1 )+/< 


x 9 


The correlation (1) transforms the points of this into the primes 
which pass through the 8 n _ t given by the equations 


x 2U+fi-l — X 2£e+2/ x +/,-l ~ 0, •••> x 2X«+2(/ 1 +...+/ t _ 1 )+/i-l = 0. 

This S n _ ( contains the S t _ ly and also contains every fundamental 
space corresponding to a latent root of A—XA' distinct from af 1 . 
The fundamental space corresponding to af 1 is seen to be an 8?L 1 
spanned by the vertices 

X%£e> X 2 £e+2/,> '^2Xe+2</ 1 +/ t )> •••» -^2 Z’e+2(/ 1 +... +/<-»)• 
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Hence S n _f and $*_ x meet in a space which is spaimed by the 
vertices 


^2£e+2if 1 +...+fi)y -^2Xe+2 ifx+...+fi +1 )> •••> -^2 £e+2(/ 1 +... +/<_!>> 

where we have assumed that/j,/^ are so arranged that 


fi 


= fi = 1> 


and 




We now turn to the general correlation discussed in § 6. In the 
simple case discussed above, the possibility of latent roots equal to 
+ 1 was excluded. Such latent roots arise here. Before finding the 
fundamental space corresponding to A = 1 we remark that in dis- 
cussing the geometrical properties of any correlation we are led 
to consider certain loci: 

(i) the locus of points which he in their transforms under (1). 
These are the points whose coordinates x = (x 0 , ...,x n ) satisfy the 
equation *u-*Az- 0. 


This equation may also be written 

(x'Ax)' = x'A'x = 0, 

or x'Bx = 0, (6) 

where B — \{A + A') is the symmetric part of A. 

(ii) the locus of points which have the same transform under 
(1) and (4). These points satisfy the equations 

(A — aA')x = 0, (6) 

where a is any latent root of A— AA', and were considered in the 
simple case. If x is any solution of (6), 

x'Ax = ocx'A'x = ax' Ax. 

Hence, if a# 1, x ' Ax = 0 , 

and therefore every point which lies in a fundamental space corre- 
sponding to a latent root a 4= 1 satisfies equation (6). 

We assume that A is in the canonical form described in § 6, and 
suppose that a = 1 in equations (6). These become 

CX wm 0 , ( 7 ) 

where C is the skew-symmetric part of A. In solving these equations 
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we observe that the only submatrices of A which yield any non-zero 
coordinates are those submatrices 0, say, such that O—O' is a 
singular matrix. These are the submatrices 

A(l). W. 

It is therefore sufficient to consider the case in which 



By direct calculation we find that a basis for the solutions of (7) 
is given by the set of vertices 


Xo, X Pl , X^, x 2ei+ei , . .., -X _ 2 (f’ 1 + . . . +e a _ i)+fa points), 

Xz 2 e> X £2e + 2 f t -l> •••> X Sie + 2 (f 1 +...+f b _ 1 )-(b-l) points), 

X^re+iSf-by X 2Et+2 zf-b+2g 1 +\y •••> X 2 Xe+2 r/-6+2(ff l +...+0 c _ ,)+(c-l) 

(c points), 

X m , X m+l , ..., X n (m — 2Xe + 2Xf + 2Zg — 6 + c). 

The point P = X^X^ where the summation is over the suffixes 
contained in the above table, lies on the locus 

x' Ax = Xa ii x i x i = 0 

if and only if 

d'-l 

2(A 0 A 1 + ... + A 2d _ 2 A 2d _ 1 ) + 2 A| ei+ ... +eo ) +i = 0, 

where < "° 

e i = ••• = = 1, e d+1 > 1, and f t = ... = f d . = 1,U +1 > 1. 

A correlation may be singular for several reasons. If we consider 
the canonical form, we see that A is singular if 

(i) there are zero matrices on the principal diagonal; 

(ii) there are submatrices D e ( 0) on the principal diagonal; 

(iii) there are submatrices F 5 (e) present; and 

(iv) if there is a combination of (i), (ii) and (iii). 
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If a correlation is singular because of (i) only, we shall say that 
it is of the first kind. If it is singular because of (ii) and (iii) only, 
we shall say that it is of the second kind. 

If a correlation is singular on account of (i), by itself, or on account 
of (iv), the canonical form is 



where A x is an (m+ 1) x (m+ 1) matrix such that the correlation 

u — A t x (8) 

in the 8 m whose equations are 

x m+l 0 

is either non-singular, or singular only because of (ii) and (iii) 
above. If <Sf n m t is the space 

*o = ••• ™ ~ 

no point of this space has a transform under the given correlation 
in [»], but the transform of any other point, if it exists, passes 
through S n _ m _ v We thus obtain the following construction for the 
given correlation in [n]; if P is any point not in S n m 1? project P 
from into the point P' of S m . Let IT be the transform of 

P' by the correlation (8) in S m , and let 77 be the prime joining 77' to 
S n _ m -v Then 77 is the transform of P by the given correlation 
in [»]. 

From this result it follows that the geometry of any correlation 
in [n] can be deduced from the geometry of a correlation of the 
second kind. The properties . of a singular correlation of the second 
kind are most easily discussed by showing that such a correlation 
can be represented as the product of a non-singular collineation 
(of a kind which is, geometrically, particularly simple) and a 
correlation (in the same space) of the first kind. We do this below. 
Ultimately, therefore, the properties of singular correlations can be 
made to depend on those of non-singular correlations in a subspace. 

Let u = Ax 

be a correlation of the second kind, where we suppose that A is 
in canonical form and, to simplify our notation, we omit the 
submatrices D e {a) (a + 0), F 3 (e), F t (e). Let 7)^(0), ...,7)^(0) be the 
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diagonal submatrices corresponding to the latent root A = 0, and 
lot 

FM, 

be the diagonal submatrices of type F 5 (e). The space of points having 
no transform in the correlation is defined by the equations 

Ax — 0. 


By direct calculation this space is seen to be the S l+C _ x 
by the t points 


Y Y 

-^■2e 1 +ea> 


•» ^2iei+...+et-i)+et> 


spanned 


and the c points whose coordinates are of the form 


( 0 , 0 , 


..., 0 , 1 , 1 , 1 , 0 , 

' , . 

0i + 1 


0 ). 


Each one of these c points arises from a submatrix F§(gJ), and the 
non-zero coordinates are g i + 1 in number, their position corre- 
sponding to that of the first g i + 1 rows of F^g^ in the matrix A. 

Again, the primes which correspond to points in [ft] with a 
transform all pass through the space spanned by the t points 


*0. -^2ei> -^2(e l +.. .+«<_!>> 

and the c points 

( 0 , 0 ,..., 0 , 1 , - 1 , 1 , - 1 ,..., ± 1 , 0 ,..., 0 ). 


0i+ 1 


Here also, each one of these c points arises from a submatrix F^gJ, 
and the non-zero coordinates, which alternate in sign, are g i + 1 in 
number, their position corresponding to that of the first g i + 1 rows 
of F h {g>) in the matrix A. This space is evidently defined by 

the equations A'x — 0 


It is immediately verified that ^ <+c _ 1 and do not meet. This 

may also be seen from the fact that if the equations 

Ax = 0 and A'x = 0 


have a common solution, the integer r 0 introduced into our discus- 
sion, p. 416, of the canonical form is not zero, and therefore zero 
matrices appear on the principal diagonal of A, contradicting our 
hypothesis that the correlation is of the second kind. 
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When we consider the points which have the same transform 
in the dual correlation as in the given one, we have only to 
concern ourselves with the subm'atrices F h (g^), ...,F 5 (g c ), since 
D e (0) — AD'(O) is non-singular if A 4= 0. Now 

[F 6 ( gi )-\F' 5 ( gi )] x = 0 

has a solution for each value of A. We see that all such solutions lie 
in the space A 

x 0 — X 1 ~ ••• — x 2(ei+...+et)-l ~ 

X 2 Xe+gi+1 = ••• = X 2Ze+2gi = ( 10 ) 


x 2Ze+2f.0 1 +...+0c-i>+(c-\)+0c+\ ~ ••• — *2 Ze+2 Zg+c—1 _ 0, 

where 227e + 227<7 + c— 1 = n+ 1. 

Now consider the collineation 


«*». 

II 

: *** 
JL 

ii 

© 

y e l 

(1M) 

Ve x +i — x i ) 

y%e x +i — X 2e x +e t +<] ^ _ q 

y •••y &2 

(11*2) 

Vle x +e 2 +i == X 2e x +i ) 

• • 

y2(ci+...+c/_i)+i x 2&ei+...+et-i)+et+i 1 

V2(e 1 +...+et- 1 )+et+i ~ X 2{e x +...+et-d+i ) 

II 

© 

l 

-l), (li e) 

V 2(ej+. . . +et)+2i ~ X 2(ei+...+^)+2i> i 

y2(e l +...+e t )+2i+l == “ x 2(e 1 +...+e l )+2i-bV J 


(n-n 


where in (1 1-f') the suffix i varies so that the equations range over 
all the remaining coordinates. This collineation is evidently an 
involutory collineation. If we write it in matrix form as 

V = Cx, (11) 

then C 2 = I n+V 

It transforms into <$£+<_ i, and leaves the space defined in 

(10) invariant. Now consider the correlation 

u = ACy. (12) 

Since u — {AC) ( Cx ) = ACy = Ax, 

our given correlation of the second kind is now expressed as the 
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product of the non-singular collineation (11) and the correlation (12). 
We show that the correlation (12) is of the first kind. In fact, actual 
multiplication of the matrices involved shows that the matrix AC 
vet symmetric, and therefore the correlation (12) is necessarily of the 
first kind. The vertex of the primes u in the correlation (12) is 
since the solutions of 

{AC)'y = 0 

are the solutions of . , _ 

A'y = 0 , 


the matrix C being non-singular. 

We conclude with some remarks on non-singular correlations. 
These are most easily discussed by means of the associated 


collineation 


y = AAx. 


(13) 


The points, lines, etc., which have the same transform under (1) 
and the dual correlation are precisely the united points, lines, etc., 
of (13), and we have discussed the methods of obtaining these in 
Chapter VIII. We recall the theorem established in § 4, that the 
elementary divisors of AA— A/ n+1 , which are those of A— AA', 
cannot be chosen arbitrarily, but if A = a (a#= ± 1) is a latent root, 
so is A = a -1 . Applying VIII, § 3, Th. V, gives us an interesting 
geometrical result. Let x be any point of [n\ which is not a united 
point, and y its transform under (13). The line xy meets all the 
fundamental spaces of the collineation (13), the point of intersection 
with the fundamental space corresponding to the latent root a being 


A a = y-ax. 

We see that the points 

A a > -A a - 1 

are in involution, and if A = ± 1 are latent roots of the collineation, 
the points A v A_ x are the united points of the involution. 

As we have already seen, the correlation u = Ax, when A is 
non-singular, gives a prime-point transformation 

y = Av. (14) 


For if x lies in the prime v, u passes through the point y, where 

u’y = x' A'y = x'v. 

We oan develop a similar theory for this prime-point correlation, 
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which is, indeed, the same correlation as u = Ax. In particular, 
the correlation associated with (14) is 

w = AAv, 

and since {A A) — AA, 


this is the dual form of (13). 

Finally, some aspects of the elementary theory of quadrics are 
of interest in connection with the theory of correlations. If the 
prime u which is the transform of the point x contains the point x, 
this point satisfies the equation 


or, equivalently, 


‘ x'u = x 1 Ax = 0, 
x’Bx = 0, 


(15) 


where B = £(.4 +.4'). 


We call the locus of points given by (15) the incidence quadric 
associated with the correlation u = Ax. Similarly, the primes v 
which contain their transforms under (14) satisfy the equation 

v'Dv = 0, (16) 

where D = \{A + 4 _1 ). 

We see that B = A' DA. 


Let us assume that the correlation u = Ax, besides being non- 
singular, is such that B and D are non-singular. The point-equation 
of the locus (16) is , n , _ n 


For this to coincide with the incidence quadric (15) we must have 

JcD- 1 = B 

for some k in K. We determine the simple non-singular correlations 
for which the ‘envelope’ incidence quadric (16) is the dual of the 
incidence quadric (16). 

The above condition is 

4fcJ n+ x = {A-' + l){A + A’) 

= 2/ n+1 + A A + A _1 A 
AA + (AA)-i = 2(2k-l)I n+1 . 

Y = AA. 


or 

Let 
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Our last condition is 


F 2 - 2(2fc-l)F + J n+1 = 0. 

Hence, since F 4=AJ ra+1 , the minimum function of the matrix F is 
[cf. II, § 10] 

^(A) = A 2 — 2(2fc— 1) A+ 1. 

By II, § 10, Th. Ill, the last invariant factor of F— A/ n+1 is A), 
and therefore all invariant factors divide r]r{ A). We consider the 
following possibilities: 

(i) 2k — 1 =1= + 1. If a, a -1 are the roots of ^(A) = 0, neither of 
these roots is equal to ± 1 . The elementary divisors of F — A/ n+1 , 
as we remarked in §4, occur in the pairs (A — a)\ (A — a _1 ) e ». Since 
the product of all the elementary divisors is of degree n + 1 , it follows 
that » + 1 is even. Finally, since the last invariant factor is simply 
(A — a) (A — a -1 ), the elementary divisors of F — A/ n+1 consist of 
\(n+ 1) elementary divisors (A — a), and an equal number of 
elementary divisors (A — a -1 ). Hence A can be reduced to the form 

A = /D x {cc) 

I D x {a) 

\ D x (a) 

where, in the usual notation, 

D x (a) = /0 a\ 

\i or 

Conversely, it is easily verified that if A is taken in this form, with 
any a which is neither zero nor + 1, then 

kD~ 1 = B (A: 4=0), 

where 7 _ (a+ 1) 2 

* ~ 4a * 

(ii) 2k— 1 = 1. The minimum function is now (A — l) 2 , and the 
elementary divisors of F— A/ ra+1 are all of the form (A— 1) or 
(A— l) 2 . Since, by §6, Th. I, there must be an even number of 
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elementary divisors of the latter type, A can be reduced to the 
form 



where [§6, (i)] 

11 ( 2 ) = /0 0 0 1 \ 

1 0 0 1 l\ 

lo i o or 

\l — 1 0 0/ 

We may again verify that if A is given in this form it satisfies all 
our requirements. 

(iii) 2k — 1 = — 1. The minimum function is (A + 1) 2 , and A = — 1 
is a latent root of the matrix Y. It follows that 

B = \(A+A') 

is singular, contrary to our hypothesis. 



BIBLIOGRAPHICAL NOTES 

Book I 

A considerable part of the material of Chapters i-iv will be found in 
works on Modem Algebra, such as Albert (1), Dickson (5), Macaulay (9) and 
van der Waerden (10). As far as we know, the development of the theory 
of linear dependence over a non-commutative field, given in Chapter n, is 
new, and the algebraic theory of Jacobians given in § 7 of Chapter hi has 
not previously been published. In Chapter iv we have preferred to use the 
term resultant-forms instead of the literal translation inertiaforms of the 
Tragheitsformen introduced by Hurwitz. 

Book II 

Chapter v. The method given here for defining a projective space has 
been used by Lefschetz (8) for the case of a commutative field, and is 
derived from Veblen and Whitehead (11). The development for a non- 
commutative field given here has not previously been published. 

Chapter vi. Much has been written on the axiomatic foundations of 
projective geometry. For a full account of these writings, the reader is 
referred to the article by Enriques (6) and to Baker (2). The method followed 
here bears most resemblance to that adopted by Veblen and Young (12), 
but differs from their treatment in the choice of the initial axioms, and in 
the introduction of coordinate systems before the imposition of limitations 
which effectively restrict the field from which the coordinates are taken. 
The development in Chapter n of the theory of linear dependence over a 
non-commutative field was designed to make this treatment possible. 

Chapter vn. The use of coordinates to distinguish linear spaces originated 
with Grassmann and was developed into line-geometry by Pliicker. Refer- 
ences will be found in the article by Segre (7), and an account is given in 
Bertini (3). Properties of determinants which are in effect Grassmann co- 
ordinates play an important part in the algebraic theory of invariants. 
The basis theorem of § 7 was first proved by Mertens. Our proof is similar 
to that given by Weitzenbock [ Proc . K, Akad. Wetensch. Amsterdam, 
vol. xxxix, p. 503 (1936)]. 

Chapter vm. The geometrical theory of collineations has been fully dis- 
cussed by Segre. For references see Segre (7). An account is given in 
Bertini (3). The treatment given here is more algebraic than either of these. 

Chapter ix. The account of correlations given here is based on the 
simultaneous consideration of symmetric and skew-symmetric bilinear 
forms, which derives from the treatment of two quadratic forms given in 
Bromwich ( 4 ), who bases his work on that of Kroneeker. For the geometrical 
aspects reference should be made to Segre (7). 
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(The numbers refer to the page where the term occurs for the first time in the 

book or is defined) 


Abelian or commutative group, 3 
of operations on a matrix, 73 
Addition of points on a line, 253 
Additive group, 3 
Adjugate matrix, 81 
Adjunction of algebraic element, 103 
of indeterminate, 19 
Algebra of points on a line, 253 
Algebraic closure of a field, 96, 112 
dependence, 108 
element of extension field, 102 
Algebraic extension over a field, .106 
primitive element of, 126 
simple, 101 

Algebraic functions, 107 
differentiation of, 128 
Algebraic function field, 107 
independence, 108 
point of projective space, 179 
Algebraically closed field, 96, 112 
Allowable coordinate systems, 178 
Allowable transformations, 59 
of restricted type, 367 
Alternating group, 5 
Apolar vectors, 69 
Associated collineation, 365 
Associated coordinate systems, 186 
Associative law of composition, 2 
for multiplication of matrices, 52 

Basis, finite algebraic, 108 * 

for extensions of finite degree, 105 
for ideal, 142 
for linear set, 45 
Basis for linear space, 183 

derived from Grassmann coordinates, 
300 

in incidence space, 209 
Basis, minimal algebraic, 108 
Basis theorem for Grassmann co- 
ordinates, 315 
theorem, Hilbert’s, 142 
Bilinear correspondence, 365 
form, 365 
Binary form, 148 

forms, resultant of two, 150 
Block matrices, 53 


Canonical form for 
collineations, 331 
correlations, 411 
A -matrices, 88, 92 
matrices, 60, 62 
null -polarity, 380 
polarity, 372 

projective transformation, 323 
skew -symmetric bilinear form, 378, 
380 

symmetric bilinear form, 368, 372 
Chain of perspectivities or projectivity, 
217 

Characteristic equation of matrix, 95 
Characteristic function of matrix, 95 
Characteristic of ring, 14 
Characteristic matrix, 95 

polynomial of algebraic element, 
102 

Coefficients of polynomial, 20 
Cofactor of element in determinant, 77 
Collinear points, 191 
Collineations, 327 

as product of null-polarities, 383 
as product of null -polarity and 
polarity, 389 

as product of polarities, 376 
associated with correlation, 365 
canonical form, 331, 332 
cyclic, 352 

fundamental space corresponding to 
latent root, 336 
in [3], 350 

induced in linear space, 342, 343 
non-special, 336 

particular, with only one fundamental 
space, 356 

Segre symbol for, 350 
similar, 328 
singular, 358 
special case, 336 
united points of, 334 
with only one united point, 344 
Column of matrix as right-hand vector, 
66 

matrix printed as row matrix, 50 
rank of matrix, 66 
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Commutative or Abelian group, 3 
Complement of matrix, 72 
Complete faetorisation of element in 
integral domain, 29 
<<&mpletely reducible polynomial, 112 
Compound matrix, 291 
Concurrent lines, 191 
Condition for commutative field, 202 
Conjugate points in polarity, 367 
spaces in polarity, 375 
Consequences of assuming Pappus* 
Theorem, 274 

Construction for O -f U(oc -f- fi), 197 
for 0+ Uafiy 199 

Coordinate of point in a scale, 259 
Coordinate systems, allowable, 178 
associated, 186 
related, 283 
scale, 283 

Coordinates, Grassmann, of linear space, 
289 

of a projected space, 308 
Coordinates of point in a reference 
system, 262 

set of, for point in projective space, 
178 

Correlation between distinct spaces, 362 
Correlation, canonical forms for 
general correlation, 418 
null-polarity, 380 
polarity, 372 
simple correlation, 394 
Correlation, 

exceptional points, 362 
geometrical loci connected with, 421 
in [n], 364 
non- singular, 426 

permissible transformations for re- 
duction of, 399 

simple, geometrical properties of, 419 
singular, 422 

singular of first and second kind, 423 
Cyclio collineations, 352 

Decomposition of element into factors, 
28 

Degree of extension of finite degree, 105 
of a polynomial, 20 

Delta symbols used in determinant 
theory, 74 

Dependence, algebraic, 108 
linear, 43 

Derivative of a polynomial, 119 
of a rational function, 120 
Desargues* Theorem, 191 

assuming Pappus* Theorem, 272 


converse, assuming the theorem, 196 
general case, 194 
in an incidence space, 213 
Determinant, 

cofactor of element in, 77 
effect of interchange of rows, 80 
Laplace’s expansion of, 75 
of matrix product, 78 
of skew matrix, 80 
Determinants, 73 
value of certain, 79 
Diagonal points of a quadrangle, 244 
Diagonals of a quadrangle, 244 
Differentiation of algebraic functions, 
128 

of polynomials, 119 

Dimension of extension over a field, 
110 

of a linear set, 45 
Displacement on a line, 356 
Distributive law for elements in a ring, 6 
for multiplication of matrices, 52 
Division algorithm, 24 
Divisor of a polynomial, 25 

of zero, right-hand, left-hand, 13 
Dual correlation, 365 

Grassmann coordinates, 292 
projective transformation, 323 
relationship between solutions of 
linear equations, 70 
spaces, 188 
theorems, 190 
Duality, Principle of, 190 

Elementary divisors of A-matrix, 92 
Elementary transformations, 63 
on A-matrices, 88 
Envelope incidence-quadric, 427 
Epsilon symbols used in determinant 
theory, 73 

Equation of projectivity on a line, 281 
Equations, homogeneous linear, 68 
solution by determinants, 84 
Equations in several unknowns, 159 
non -homogeneous linear, 85 
of a linear space, 188 
Equivalence relation, 10 
Equivalent extensions of a commuta- 
tive field, 101 
factorisations, 29 
A-matrices, 88 
(n+ l)-tuples, 176 
primes in a u.f.d., 30 
rings, 10 

solutions of homogeneous equa- 
tions, 140 
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Even permutation, 5 
Exceptional points of a correlation, 362 
of a singular projective transforma- 
tion, 322 

Exchange theorem 
for linear sets, 46 
generalised form, 108 
in projective space, 182 
Extensions, equivalent, of a commuta- 
tive field, 101 

Extensions of a commutative field, 101 
of finite degree, 106 
of projective number space, 178 
of projective space, 179 

Factor of element in integral domain, 28 
Factorisation, complete, 29 
in an integral domain, 27 
in polynomial rings, 33 
of polynomials, 111 
Factorisations, equivalent, 29 
Field, 13 

algebraic closure, 96, 112 
algebraically closed, 96, 112 
commutative, 13 
example of non-commutative, 39 
formed by points on a line, 268 
ground, 41 

Field obtained by adjunction of alge- 
braic element, 103 
of transcendental element, 104 
Field of characteristic p, 37 

of coefficients of set of equations, 146 
Field of geometry algebraically closed, 
274 

commutative, 272 
in an incidence space, 269 
without characteristic, 273 
Field of rational functions of #, 38 
of x l9 ...» x r , 39 

Field, root, of a polynomial, 114 
Fields, examples of, 37 
Finite algebraic basis, 108 
Finite geometries, 206 
Form, 140 
binary, 148 

Forms, canonical, see Canonical form 
Fundamental projectivities on a line, 
278 

Fundamental space corresponding to 
latent root of collineation, 336 
to infinite latent root of correlation, 
420 

to zero latent root of correlation, 419 
Fundamental space, rule for its deter- 
mination, 336 


spaces, their join, 336 
theorem of algebra, 112 

General correlation, 396 
Generic linear space, 316 
Geometrical loci connected with corre- 
lations, 421 

properties of a simple correlation, 
419 

Grassmann coordinates, 288 
basis theorem, 316 
determine basis for linear space, 300 
dual, 293 

elementary properties, 297 
intersections and joins, 304 
linear space with given, 312 
of linear space, 289 
of projected space, 308 
quadratic p-relations, 309 
relation between, and dual, 294 
three-term p-relations, 311 
Ground field, 41 
Group, 2 
additive, 3 
alternating, 5 
commutative or Abelian, 3 
inverse of element in, 4 
non-commutative, 3 
of operations on a matrix, 73 
of projective transformations, 177 
subgroup, 6 
symmetric, 6 
unity of, 4 
zero of, 4 

Harmonic conjugates, 224 
special case, 229 

Harmonically conjugate pairs, 230 
Highest common divisor of two poly- 
nomials, 26 

factor of elements of integral 
domain, 31 

Hilbert’s basis theorem, 142 
zero -theorem, 166 
Homogeneous equations, 140 
linear equations, 68 
Homographic ranges, 217 
Homology, 366 
involutory, 366 

Ideal, 11 

basis for, 142 

defines equivalence relation in rings, 
12 

improper, 12 
prime, 168 
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Ideal (cont.) 
resultant, 166 
right-hand, left-hand, 11 
unit, 12 

Ide&ls associated with sets of equations, 
140 

Improper ideal, 12 
Incidence; Propositions of, 186, 208 
Incidence quadric of correlation, 427 
Incidence space of n dimensions, 211 
as a P l n (K), 272 
as a P r n (K), 259 
Independence, algebraic, 108 
linear, 43 

Independent indeterminates over a 
held, 106 

Indeterminate, adjunction of, 19 
Indeterminate coefficients, system of 
equations with, 146 
Indeterminates, independent over a 
field, 106 

Infinity, properties of, 280 
Integral domain, 13 
indeterminate over, 21 
quotient field of, 16 
Intermediary line in projectivity, 218 
Intersection of linear spaces, 189 
in an incidence space, 211 
in Grassmann coordinates, 304 
Invariant factors of A -matrix, 92 
Invariant of null-polarity, 380 
of polarity, 372 
Inverse matrix, 66, 81 
of element in a group, 4 
Involution on a line, 352 
Involutory homology, 356 

particular collineation, 355 
transformation, 352 
Irreducible polynomial, 27 
Isobaric, 155 

Isomorphism between rings, 10 
Isomorphism, skew, 176 

Jacobian, 133 
matrix, 133 

Jaoobi’s theorem on determinant of 
submatrix of adjugate matrix, 
82 

Join of fundamental spaces in a colli- 
neation, 336 

Join of linear spaces, 189 
in an incidence space, 211 
in Grassmann coordinates, 306 

Kronecker delta, 54 


A -matrices, canonical forms for, 88, 92 
elementary transformations on, 88 
equivalence of, 88 
A -matrix, 86 

elementary divisors of, 92 
invariant factors of, 92 
Laplace's expansion of determinant, 75 
Latent roots of a matrix, 95 
Law of composition, 2 
Leading term of polynomial, 142 
Left-hand projective space or P^(X), 
185 

Left-hand vector, 48 
row of matrix as, 56 
Line, united, containing only one 
united point, 350 

meeting vertices of fundamental 
stars, 342 

Linear complex, 381 
dependence, 43 
independence, 43 
Linear dependence, in P r n (K)> 180 
of one point on others in projective 
space, 182 

of points in an incidence space, 208, 
268 

Linear homogeneous equations, 68 
manifold of solutions, 69 
right-hand, 69 

solution by determinants, 84 
Linear non -homogeneous equations, 85 
Linear set, 41 
basis for, 45 
dimension of, 45 
left-hand, 41 
minimal basis for, 45 
right-hand, 42 
Linear space, 183, 208 
basis for, 183 
equations of, 188 
generic, 315 

with given Grassmann coordinates, 
312 

Linear spaces, intersection of, 189, 211 
join of, 189, 211 

Lying in, relation between linear spaces, 
184 

Manifold of solutions of linear homo- 
geneous equations, 69 
Matrices, 49 

over a commutative field, 71 
postmultiplication of, 51 
premultiplication of, 5 1 
product of, 51 
ring of square, 7, 54 
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Matrices (cont.) 
square, 54 
sum of, 50 
Matrix, 

adjugate, 81 
characteristic, 95 
characteristic function of, 95 
column printed as row, 50 
complement of, 72 
compound, 291 
determinant of, 74 
determinant of product, 78 
division into blocks, 53 
Jacobian, 133 
product, 51 
rank, 65 
regular, 55 
ring, 54 

rows as vectors, 60 
singular, 72 
skew -symmetric, 86 
square root of, 96 
submatrix, 52 
symmetric, 86 
transformations of, 58 
transpose of, 66 
transpose of product, 72 
zero, 51 

Minimal algebraic basis, 108 
basis of linear set, 45 
Minimum function of matrix, 96 
Minus sign, multiplicative property, 10 
Multiplication of matrices, associative 
law, 52 

distributive law, 52 
Multiplication of points on a line, 256 

(n-f l)-tuple, 176 
(n-f 1) -tuples, equivalent, 176 
Negative of element in additive group, 4 
Non-commutative field, 39 
group, 3 

Non-Desarguesian geometry, 213 
Non-homogeneous equations, 162 
linear equations, 85 
Non-singular correlations, 426 

projective transformations, 322 
Non-special collineation, 336 
Null -polarity, 366, 378 
canonical form for, 380 
invariant of, 380 

Odd permutation, 5 
One-to-one correspondence, 10 
defined by perspectivity, 216 
Order of square matrix, 362 
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Pn(K) or left-hand projeotive space of 
dimension n over K , 185 
P r n (K) or right-hand projeotive space of 
dimension n over K , 178 
Pappus* theorem, 203 
consequences of, 274 
Partial derivatives, 121 
Particular collineations, 354 

with only one fundamental space, 
356 

Particular involutory collineation, 355 
Pencil of lines, 216 

Permissible transformations for re- 
duction of correlation, 399 
Permutation, even, odd, 5 
Perspective ranges, 216 
Perspective, triangles in, 192 
PN l n (K) or left-hand projective number 
space of dimension n over K, 
176 

PN^(K) or right-hand projective num- 
ber space of dimension n over K , 
176 

Point in projective space, 178 
algebraic, 179 
rational, 179 

symbolic notation for, 181 
transcendental, 179 
Point triad, 245 
Polarities, 367 
Polarity, 366 

canonical form, 368, 372 
conjugate points in, 367 
conjugate spaces in, 375 
invariant of, 372 
vertex of, 374 
Polynomial, 20 

characteristic, of algebraic element, 
102 

coefficients of, 20 
completely reducible, 112 
degree of, 20 
derivative of, 119 
divisor of, 25 
irreducible, 27 
leading term of, 142 
primitive, 33 
Polynomial rings, 19 
Polynomial, root of, 114 
root field of, 114 

symmetric functions of roots of, 115 
total degree of, 115 
zero of, 114 

Polynomials, in a non-commutative 
ring, 23 
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Polynomials ( cont .) 
relatively prime, 20 
specialisation of, 22 
Taylor’s theorem for, 122 
Postmultiplication of matrices, 51 
Premultiplication of matrices, 51 
Prime or S n _ l9 211 
Prime ideal, 168 

element of integral domain, 28 
Primes, equivalent, in integral domain, 
30 

star of, in projective space, 339 
Primitive element of algebraic exten- 
sion, 126 
polynomial, 33 
Principle of duality, 190 
Product of matrices, 51 

of projective transformations, 177 
Projective correspondence between 
points and spaces, 326 
Projective number space, 176 
extension of, 178 
Projective ranges, 217 
Projective space of n dimensions, 178 
point in, 178 

Projective transformation, 322 
canonical form, 323 
dual, 323 

exceptional points of, 322 
non-singular, 322 
rank of, 323 
singular, 322 
singular space of, 325 
vertex of, 325 

Projective transformations, 177, 322 
group of, 177 

Projectively identical linear spaces, 
186 

invariant constructions, 231, 238 
Projectivities, fundamental, on a line, 
278 

Projectivity or chain of perspectivities, 
217 

equation of, on a line, 281 
Proper divisor of a polynomial, 25 
Properties of infinity, 280 
Propositions of incidence, 185, 208 

Quadrangle, 244 
Quadrangular set of points, 245 
Quadratic p- relations for Grassmann 
coordinates, 309 
alternative form, 312 
Quadric, 375 
envelope incidence, 427 
incidence, of correlation, 427 


Quaternions, 40 

Quotient field of integral domain, 15 

r-way spaces, 206 
Range of points, 216 
Ranges, homographically related, 217 
in perspective, 216 
projectively related, 217 
Rank of a matrix, 65 
column-rank, 66 

connection with determinants in 
commutative case, 84 
row- rank, 60 

Rank of projective transformation, 323 
Rational point of projective space, 179 
Reference system in incidence space, 
249 

Regular element, or unit, of ring, 13 
matrix, 55 

Related coordinate systems, 283 
ranges, 216 

Relation of one linear space lying in 
another, 184 

Relatively prime polynomials, 26 
Remainder theorem, 25 
Representation of incidence space as a 
P r n (K), 259 

Restricted type of allowable trans- 
formation, 367 

Restrictions on the geometry in an 
incidence space, 272 
Resultant-forms, 166 
Resultant ideal, 166 

for set of binary equations, 156 
of (n*f 1) homogeneous equations in 
(n-fl) unknowns, 170 
of two binary forms, 150 
system for equations in several un- 
knowns, 159 

Resultant, properties of, 152 
Right-hand linear homogeneous equa- 
tions, 69 

Right-hand vectors, 48, 65 
columns of matrix as, 66 
Ring, 6 

characteristic of, 14 
condition for field, 13 
condition for integral domain, 13 
not a u.f.d., 29 

of coefficients of set of equations, 146 
of square matrices, 54 
polynomial, 19 

regular element, or unit, of, 13 
unit, or regular element of, 13 
unity of, 12 

without characteristic, 14 
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Rings, 6 

classification of, 12 
equivalence between, 10 
extension of, 1 1 
isomorphic, 10 f 
subrings, 11 

Root of polynomial, 114 

field of a polynomial, 114 
Roots, latent, of a matrix, 95 
Row-rank of a matrix, 60 
Rows of matrix as vectors, 56 
Rule for determining fundamental 
space of collineation, 336 
vertex of united primes of dual 
collineation, 339 

Scale coordinate systems, 283 
Scale on a line, 253 
Segre symbol for collineations, 350 
for singular collineations, 361 
Self-conjugate points in a polarity, 
367 

S k in a null-polarity, 382 
Set, linear, 41 

Set of coordinates of point in projective 
space, 178 

Signed minor or cofactor, 77 
Similar collineations, 328 
Simple algebraic extensions, 99 
Simple correlations, 390 
geometrical properties, 419 
Simplex in an incidence space, 211 
of reference in projective space, 
182 

Singular collineations, 358 
Segre symbol, 361 
Singular collineations in [3], 361 
Segre symbol, 361 
Singular correlations, 422 
of first and second kind, 423 
Singular matrix, 72 

projective transformation, 322 
space of a projective transforma- 
tion, 325 

Skew-isomorphism, 175 
Skew-symmetric bilinear form, 366 
canonical form, 378, 380 
Skew-symmetric matrix, 86 
Solution of linear homogeneous equa- 
tions, 68 

by determinants in the commutative 
case, 84 

-Solutions of a set of equations, 139 
Space, incidence, 211 
linear, 183, 208 

of points with no transform, 358 1 


projective, 178 
. projective number, 170 
Special case of collineation, 336 
of harmonic conjugacy, 229 
Specialisation of polynomials, 22 
Square matrices, 54 

root of a matrix, 90 
Star of primes, 339 
vertex of, 339 
Subgroup, 5 
Submatrix, 52 
Subring, 11 

Symbolic notation for points in pro- 
jective space, 181 
Symmetric bilinear form, 366 
canonical form, 368, 372 
Symmetric functions of roots of a 
polynomial, 115 
group, 5 

integral function, 115 
Symmetric matrix, 86 
Symmetric matrix, lemma on rank, 403 
System of equations with indeter- 
minate coefficients, 146 
System of reference in incidence space, 
249 

Taylor’s theorem for polynomials, 122 
Three -term relation for Grassmann co- 
ordinates, 311 

Total degree of a polynomial, 115 
Transcendental element of extension 
field, 102 

extension of a field, 104 
point of projective space, 179 
Transform of set of vectors, 49 
Transformations, allowable, 59 
elementary, 63 
of general correlation, 395 
of matrix, 58 
projective, 177, 322 
Transpose of matrix, 66 
determinant of, 74 
Transpose of matrix product, 72 
Transposition permutation, 6 
Triangle, 191 
Triangle triad, 245 
Triangles in perspective, 192 
Trivial decomposition into factors, 28 

U.f.d. or unique factorisation domain, 29 
Unit or regular element of ring, 13 
Unit ideal, 12 

United &-spaoes of a collineation, 344 
lines of [n], 348 
points of a collineation, 334 
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Unity of a group, 4 
of a matrix ring, 8, 54 
of a ring, 12 
.Unknowns, 139 

of linear homogeneous equations, 68 
u-resultant of system of equations, 
171 

Vectors, apolar, 69 

given by columns of matrix, 66 
given by rows of matrix, 56 
right-hand, left-hand, 48 


Vertex of a polarity, 374 

of a projective transformation, 325 
of a star of primes, 339 

Weight of isobaric^polynomial, 155 

Zero, divisor of, 13 
matrix, 51 
of a group, 4 
of a matrix ring, 7 
of a polynoniial, 114 
-theorem, Hilbert’s, 165 



